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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

<i) APPLICANT: KNUTZCjfN, DEBORAH 
MURKER JI, PRADIP 
HUANG / YUNG-SHENG 
THURMQND, JENNIFER 
CHAUCJHARY, SUNITA 
LEONARD, AMANDA 

(ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR SYNTHESIS 
OF LONG CHAIJN POLY-UNSATURATED FATTY ACIDS 

(iii) NUMBER OF SEQUENCES: 4 0 

(iv) CORRESPONDENCE] ADDRESS: 

(A) ADDRESSEE/: LIMBACH AND LIMBACH LLP 

(B) STREET: 2001 FERRY BUILDING 

(C) CITY: SPJf FRANCISCO 

(D) STATE: Ci 

(E) COUNTRY: j USA 

(F) ZIP: 9410.1 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TJYPE : Floppy disk 

(B) COMPUTE!*: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWAR^: Microsoft Word 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) (B) FILING DATE: 
(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: WARD, MICHAEL R. 

(B) REGISTRATION NUMBER: 38,651 

(C) REFERENCE / DOCKET NUMBER: CGAB-210 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 433-4150 

(B) TELEFAX: (415) 433-8716 

(C) TELEX/: N/A 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENOTH: 1617 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE 



(xi) SEQUENCE 
CGACACTCCT 



TCCTTC ITTCT 



TYPE: other nucleic acid 



DESCRIPTION: SEQ ID NO:l: 

CACCCGTCCT AGTCCCCTTC AACCCCCCTC TTTGACAAAG 
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ACAACAAACC ATGGCTGCTG CTCCCAGTGT GAGGACGTTT ACTCGGGCCG AGGTTTTGAA 

TGCCGAGGCT CTGAATGAGG GCAAGAAGGA TGCCGAGGCA CCCTTCTTGA TGATCATCGA 

CAACAAGGTG TACGATGTCC GCGAGTTCGT CCCTGATCAT CCCGGTGGAA GTGTGATTCT 

CACGCACGTT GGCAAGGACG GCACTGACGT CT TTG AC ACT TTTCACCCCG AGGCTGCTTG 

GGAGACTCTT GCCAACTTTT ACGTTGGTGA TATTGACGAG AGCGACCGCG ATATCAAGAA 

TGATGACTTT GCGGCCGAGG TCCGCAAGCT GCGTACCTTG TTCCAGTCTC TTGGTTACTA 

CGATTCTTCC AAGGCATACT ACGCCTTCAA GGTCTCGTTC AACCTCTGCA TCTGGGGTTT 

GTCGACGGTC ATTGTGGCCA AGTGGGGCCA GACCTCGACC CTCGCCAACG TGCTCTCGGC 

TGCGCTTTTG GGTCTGTTCT GGCAGCAGTG CGGATGGTTG GCTCACGACT TTTTGCATCA 

CCAGGTCTTC CAGGACCGTT TCTGGGGTGA TCTTTTCGGC GCCTTCTTGG GAGGTGTCTG 

CCAGGGCTTC TCGTCCTCGT GGTGGAAGGA CAAGCACAAC ACTCACCACG CCGCCCCCAA 

CGTCCACGGC GAGGATCCCG ACATTGACAC CCACCCTCTG TTGACCTGGA GTGAGCATGC 

GTTGGAGATG TTCTCGGATG T C C C AG AT G A GGAGCTGACC CGCATGTGGT CGCGTTTCAT 

GGTCCTGAAC CAGACCTGGT TTTACTTCCC CATTCTCTCG TTTGCCCGTC TCTCCTGGTG 

CCTCCAGTCC ATTCTCTTTG TGCTGCCTAA CGGTCAGGCC CACAAGCCCT CGGGCGCGCG 

TGTGCCCATC TCGTTGGTCG AGCAGCTGTC GCTTGCGATG CACTGGACCT GGTACCTCGC 

CACCATGTTC CTGTTCATCA AGGATCCCGT CAACATGCTG GTGTACTTTT TGGTGTCGCA 

GGCGGTGTGC GGAAACTTGT TGGCGATCGT GTTCTCGCTC AACCACAACG GTATGCCTGT 

GATCTCGAAG GAGGAGG CGG T CG AT AT GG A TTTCTTCACG AAG C AG AT C A TCACGGGTCG 

TGATGTCCAC CCGGGTCTAT TTGCCAACTG GTTCACGGGT GGATTGAACT ATCAGATCGA 

GCACCACTTG TTCCCTTCGA TGCCTCGCCA CAACTTTTCA AAGATCCAGC CTGCTGTCGA 

GACCCTGTGC AAAAAGTACA ATGTCCGATA CCACACCACC GGTATGATCG AGGGAACTGC 

AGAGGTCTTT AGCCGTCTGA ACGAGGTCTC CAAGGCTGCC TCCAAGATGG GTAAGGCGCA 

GTAAAAAAAA AAACAAGGAC GTTTTTTTTC GCCAGTGCCT GTGCCTGTGC CTGCTTCCCT 

TGTCAAGTCG AGCGTTTCTG GAAAGGATCG TTCAGTGCAG TATCATCATT CTCCTTTTAC 

CCCCCGCTCA TAT CT C AT T C ATTTCTCTTA TTAAACAACT TGTTCCCCCC TTCACCG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 amino acids 
{B) TYPE: amino acid 
<C) STRANDEDNESS: not relevant 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
15 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
20 25 30 

Leu Met He He Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Glv 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 ™ 75 80 

Ala Asn Phe Tyr Val Gly Asp He Asp Glu Ser Asp Arg Asp He Lys 
85 90 95 

Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 no 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys He Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 ISO 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
!8° 185 190 

Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
«° 215 220 

He Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 31 <> 315 320 

Leu Phe He Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
325 330 335 

Gin Ala Val Cys Gly Asn Leu Leu Ala He Val Phe Ser Leu Asn His 
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340 345 350 

Asn Gly Met Pro Val lie Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin He He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys He Gin Pro Ala Val 
405 410 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
420 425 430 

He Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA < genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTCCCCTGTC GCTGTCGGCA CACCCCATCC TCCCTCGCTC CCTCTGCGTT TGTCCTTGGC 60 

CCACCGTCTC TCCTCCACCC TCCGAGACGA CTGCAACTGT AATCAGGAAC CGACAAATAC 120 

ACGATTTCTT TTTACTCAGC ACCAACTCAA AATCCTCAAC CGCAACCCTT TTTCAGGATG 180 

GCACCTCCCA ACACT AT CG A TGCCGGTTTG ACCCAGCGTC ATATCAGCAC CTCGGCCCCA 24 0 

AACTCGGCCA AGCCTGCCTT CGAGCGCAAC TACCAGCTCC CCGAGTTCAC CATCAAGGAG 300 

AT CCGAGAGT GCATCCCTGC CCACTGCTTT GAGCGCTCCG GTCTCCGTGG TCTCTGCCAC 360 

GTTGCCATCG ATCTGACTTG GGCGTCGCTC TTGTTCCTGG CTGCGACCCA GATCGACAAG 420 

TTTGAGAATC CCTTGATCCG CTATTTGGCC TGGCCTGTTT ACTGGATCAT GCAGGGTATT 4 80 

GTCTGCACCG GTGTCTGGGT GCTGGCTCAC GAGTGTGGTC ATCAGTCCTT CTCGACCTCC 54 0 

AAGACCCTCA ACAACACAGT TGGTTGGATC TTGCACTCGA TGCTCTTGGT CCCCTACCAC 600 

TCCTGGAGAA TCTCGCACTC GAAGCACCAC AAGGCCACTG GCCATATGAC CAAGGACCAG 660 

GTCTTTGTGC CCAAGACCCG CTCCCAGGTT GGCTTGCCTC CCAAGGAGAA CGCTGCTGCT 720 

GCCGTTCAGG AGGAGGACAT GTCCGTGCAC CTGGATGAGG AGGCTCCCAT TGTGACTTTG 780 

TTCTGGATGG TGATCCAGTT CTTGTTCGGA TGGCCCGCGT ACCTGATTAT GAACGCCTCT 840 
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GGCCAAGACT ACGGCCGCTG GACCTCGCAC TTCCACACGT ACTCGCCCAT CTTTGAGCCC 900 

CGCAACTTTT TCGACATTAT TATCTCGGAC CTCGGTGTGT TGGCTGCCCT CGGTGCCCTG 960 

ATCTATGCCT CCATGCAGTT GTCGCTCTTG ACCGTCACCA AGTACTATAT TGTCCCCTAC 1020 

CTCTTTGTCA ACTTTTGGTT GGTCCTGATC ACCTTCTTGC AGCACACCGA TCCCAAGCTG 1080 

CCCCATTACC GCGAGGGTGC CTGGAATTTC CAGCGTGGAG CTCTTTGCAC CGTTGACCGC 114 0 

TCGTTTGGCA AGTTCTTGGA CCATATGTTC CACGGCATTG TCCACACCCA TGTGGCCCAT 1200 

CACTTGTTCT CGCAAATGCC GTTCTACCAT GCTGAGGAAG CTACCTATCA TCTCAAGAAA 12 60 

15 CTGCTGGGAG AGTACTATGT GTACGACCCA TCCCCGATCG TCGTTGCGGT CTGGAGGTCG 1320 

TTCCGTGAGT GCCGATTCGT GGAGGATCAG GGAGACGTGG TCTTTTTCAA GAAGTAAAAA 1380 

AAAAGACAAT GGACCACACA CAACCTTGTC TCTACAGACC TACGTATCAT GTAGCCATAC 14 40 

CACTTCATAA AAGAACATGA GCTCTAGAGG CGTGTCATTC GCGCCTCC 14 88 
(2) INFORMATION FOR SEQ ID NO: 4: 
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25 (i) SEQUENCE CHARACTERISTICS: 

ij (A) LENGTH: 39 9 amino acids 

fn (B) TYPE: amino acid 

n (C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



5 Met Ala Pro Pro A sn Thr lie Asp Ala Gly Leu Thr Gin Arg His lie 

L! a* 1 5 10 15 



Ser Thr Ser Ala Pro Asn Ser Ala Lys Pro Ala Phe Glu Arq Asn Tvr 
20 25 30 

Gin Leu Pro Glu Phe Thr He Lys Glu He Arg Glu Cys He Pro Ala 
35 40 45 

His Cys Phe Glu Arg Ser Gly Leu Arg Gly Leu Cys His Val Ala He 
50 55 60 

Asp Leu Thr Trp Ala Ser Leu Leu Phe Leu Ala Ala Thr Gin He Asp 
65 7 0 75 80 

Lys Phe Glu Asn Pro Leu He Arg Tyr Leu Ala Trp Pro Val Tyr Trp 
85 90 95 

He Met Gin Gly He Val Cys Thr Gly Val Trp Val Leu Ala His Glu 
100 105 no 

Cys Gly His Gin Ser Phe Ser Thr Ser Lys Thr Leu Asn Asn Thr Val 
115 120 125 

Gly Trp He Leu His Ser Met Leu Leu Val Pro Tyr His Ser Trp Arq 
130 135 140 

lie Ser His Ser Lys His His Lys Ala Thr Gly His Met Thr Lys Asp 
145 . . 150 155 160 
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Gin Val Phe Val Pro Lys Thr Arg Ser Gin Val Gly Leu Pro Pro Lys 
165 170 175 

Glu Asn Ala Ala Ala Ala Val Gin Glu Glu Asp Met Ser Val His Leu 
180 185 190 

Asp Glu Glu Ala Pro lie Val Thr Leu Phe Trp Met Val lie Gin Phe 
195 200 205 

Leu Phe Gly Trp Pro Ala Tyr Leu He Met Asn Ala Ser Gly Gin Asp 
210 215 220 

Tyr Gly Arg Trp Thr Ser His Phe His Thr Tyr Ser Pro He Phe Glu 
225 230 235 240 

Pro Arg Asn Phe Phe Asp He He He Ser Asp Leu Gly Val Leu Ala 
245 250 255 

Ala Leu Gly Ala Leu He Tyr Ala Ser Met Gin Leu Ser Leu Leu Thr 
260 265 270 

Val Thr Lys Tyr Tyr He Val Pro Tyr Leu Phe Val Asn Phe Trp Leu 
275 280 285 

Val Leu He Thr Phe Leu Gin His Thr Asp Pro Lys Leu Pro His Tyr 
290 295 300 

Arg Glu Gly Ala Trp Asn Phe Gin Arg Gly Ala Leu Cys Thr Val Asp 
305 310 315 320 

Arg Ser Phe Gly Lys Phe Leu Asp His Met Phe His Gly He Val His 
325 330 335 

Thr His Val Ala His His Leu Phe Ser Gin Met Pro Phe Tyr His Ala 
340 345 350 

Glu Glu Ala Thr Tyr His Leu Lys Lys Leu Leu Gly Glu Tyr Tyr Val 
355 360 365 

Tyr Asp Pro Ser Pro He Val Val Ala Val Trp Arg Ser Phe Arg Glu 
370 375 380 

Cys Arg Phe Val Glu Asp Gin Gly Asp Val Val Phe Phe Lys Lys 
385 390 395 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Val Arg Lys Leu Arg Thr Leu Phe Gin Ser Leu Gly Tyr Tyr Asp 
1 5 10 15 

Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val Ser Phe Asn Leu Cys He 
20 25 30 
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Trp Gly Leu Ser Thr Val He Val Ala Lys Trp Gly Gin Thr Ser Thr 
35 40 45 

5 Leu Ala Asn Val Leu Ser Ala Ala Leu Leu Gly Leu Phe Trp Gin Gin 

50 55 eo 

Cys Gly Trp Leu Ala His Asp Phe Leu His His Gin Val Phe Gin Asp 
10 65 70 7 * 80 

Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe Leu Gly Gly Val Cys Gin 
85 90 95 



15 



20 



Q 35 



45 



50 



Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys His Asn Thr His His Ala 
1°0 105 no 

Ala Pro Asn Val His Gly Glu Asp Pro Asp He Asp Thr His Pro Leu 
115 120 125 

Leu Thr Trp Ser Glu His Ala Leu Glu Met Phe Ser Asp Val Pro Asp 
130 135 140 



O Glu Glu Leu Thr Arg Met Trp Ser Arg Phe Met Val Leu Asn Gin Thr 

O 25 150 155 160 

^ Trp Phe phe Pro He Leu Ser Phe Ala Arg Leu Ser Trp Cys Leu 

\J] 165 170 175 

A ™ Gln S€r 116 LSU Phe Val Leu Pro Asn Gl y Gln Ala His Lys Pro Ser 

! : : JU 180 185 190 

v -~ 

iU Gly Ala Ar 9 v al Pro lie Ser Leu Val Glu Gln Leu Ser Leu Ala Met 

== 195 200 205 

His Trp Thr Trp Tyr Leu Ala Thr Met Phe Leu Phe lie Lys Asp Pro 
210 215 220 

Val Asn Met Leu Val Tyr Phe Leu Val Ser Gln Ala Val Cys Gly Asn 
40 225 230 235 240 

Leu Leu Ala He Val Phe Ser Leu Asn His Asn Gly Met Pro Val He 
245 250 255 



Ser Lys Glu Glu Ala Val Asp Met Asp Phe Phe Thr Lys Gln He He 
260 265 270 

Thr Gly Arg Asp Val His Pro Gly Leu Phe Ala Asn Trp Phe Thr Gly 
275 280 285 

Gly Leu Asn Tyr Gln He Glu His His Leu Phe Pro Ser Met Pro Arg 
290 295 300 



His Asn Phe Ser Lys He Gln Pro Ala Val Glu Thr Leu Cys Lys Lys 
55 JU:> 310 315 320 

Tyr Asn Val Arg Tyr His Thr Thr Gly Met He Glu Gly Thr Ala Glu 
325 330 335 

60 Phe Ser LeU Asn Glu Val Ser L V S Ala Ala Ser Lys Met Gly 

340 345 350 

Lys Ala Gln 
355 

(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



Val Thr Leu Tyr Thr Leu Ala Phe Val Ala Ala Asn Ser Leu Gly Val 

15 1 5 10 15 

Leu Tyr Gly Val Leu Ala Cys Pro Ser Val Xaa Pro His Gin He Ala 
20 25 30 

20 Ala Gly Leu Leu Gly Leu Leu Trp He Gin Ser Ala Tyr He Gly Xaa 

35 40 45 



Asp Ser Gly His Tyr Val He Met Ser Asn Lys Ser Asn Asn Xaa Phe 

50 55 60 

Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly He lie Ala Trp Trp 
65 70 75 80 



Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr 
30 85 90 95 

Gly Pro Asn Leu Gin His He Pro 
100 

35 (2} INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

40 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



50 Gly Val Leu Tyr Gly Val Leu Ala Cys Thr Ser Val Phe Ala His Gin 

15 10 15 



He Ala Ala Ala Leu Leu Gly Leu Leu Trp He Gin Ser Ala Tyr He 
20 25 30 

Gly His Asp Ser Gly His Tyr Val He Met Ser Asn Lys Ser Tyr Asn 
35 40 45 



Arg Phe Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly He Ser He 
60 50 55 60 

Ala Trp Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser 
65 70 75 80 

^65 Leu Asp Tyr Asp Pro Asp Leu Gin His He Pro Val Phe Ala Val Ser 

85 90 95 

-103- 



WO 98/46763 



PCT/US98/07126 



Thr Lys Phe Phe Ser Ser Leu Thr Ser Arg Phe Tyr Asp Arg Lys Leu 
100 105 HO 

Thr Phe Gly Pro Val Ala Arg Phe Leu Val Ser Tyr Gin His Phe Thr 
115 120 125 

Tyr Tyr Pro Val Asn Cys Phe Gly Arg He Asn Leu Phe He Gin Thr 
130 135 140 

Phe Leu Leu Leu Phe Ser Lys Arg Glu Val Pro Asp Arg Ala Leu Asn 
145 150 155 160 

Phe Ala Gly He Leu Val Phe Trp Thr Trp Phe Pro Leu Leu Val Ser 
165 170 175 

Cys Leu Pro Asn Trp Pro Glu Arg Phe Phe Phe Val Phe Thr Ser Phe 
180 185 190 

Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu Asn His Phe Ala 
195 200 205 

Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp Trp Phe Glu Lys 
210 215 220 

Gin Ala Ala Gly Thr He Asp He Ser Cys Arg Ser Tyr Met Asp Trp 
225 230 235 240 

Phe Phe Gly Gly Leu Gin Phe Gin Leu Glu His His 
245 250 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Xaa Xaa Asn Phe Ala Gly He Leu Val Phe Trp Thr Trp Phe Pro 
15 10 15 

Leu Leu Val Ser Cys Leu Pro Asn Trp Pro Glu Arg Phe Xaa Phe Val 
20 25 30 

Phe Thr Gly Phe Thr Val Thr Ala Leu Gin His He Gin Phe Thr Leu 
35 40 45 

Asn His Phe Ala Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp 
50 55 60 

Trp Phe Glu Lys Gin Ala Ala Gly Thr He Asp He Ser Cys Arg Ser 
65 7 0 75 80 

Tyr Met Asp Trp Phe Phe Cys Gly Leu Gin Phe Gin Leu Glu His His 
85 90 95 

Leu Phe Pro Arg Leu Pro Arg Cys His Leu Arg Lys Val Ser Pro Val 
100 105 no 
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Gly Gin Arg Gly Phe Gin Arg Lys Xaa Asn Leu Ser Xaa 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Pro Ala Thr Glu Val Gly Gly Leu Ala Trp Met lie Thr Phe Tyr Val 
15 10 15 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
20 25 30 

Gly Leu Phe Phe lie Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 
35 40 45 

Val Thr Gin Met Asn His lie Pro Met His He Asp His Asp Arg Asn 
50 55 60 

Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 
65 70 75 80 

Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu 
85 90 95 

His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Xaa Val Ala 
100 105 110 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly He Glu Tyr Gin Ser 
115 120 125 

Lys Pro Leu 
130 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Ser Pro Lys Ser Ser Pro Thr Arg Asn Met Thr Pro Ser Pro Phe 
1 5 io is 

He Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
20 25 30 
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Phe Pro Thr Met Pro Arg Cys Asn Leu Asn Arg Cys Met Lys Tyr Val 
35 40 45 

Lys Glu Trp Cys Ala Glu Asn Asn Leu Pro Tyr Leu Val Asp Asn Tvr 
50 55 60 

Phe Val Gly Tyr Asn Leu Asn Leu Gin Gin Leu Lys Asn Met Ala Glu 
65 70 75 80 

Leu Val Gin Ala Lys Ala Ala 
85 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg His Glu Ala Ala Arg Gly Gly Thr Arg Leu Ala Tyr Met Leu Val 
1 5 io is 

Cys Met Gin Trp Thr Asp Leu Leu Trp Ala Ala Ser Phe Tvr Ser Ara 
20 25 30 

Phe Phe Leu Ser Tyr Ser Pro Phe Tyr Gly Ala Thr Gly Thr Leu Leu 
35 40 45 

Leu Phe Val Ala Val Arg Val Leu Glu Ser His Trp Phe Val Trp lie 
SO 55 60 

Thr Gin Met Asn His He Pro Lys Glu He Gly His Glu Lys His Arg 
65 7 0 75 80 

Asp Trp Ala Ser Ser Gin Leu Ala Ala Thr Cys Asn Val Glu Pro Ser 
85 90 95 

Leu Phe He Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu His 
1°0 105 no 

His Leu Phe Pro Thr Met Thr Arg His Asn Tyr Arg Xaa Val Ala Pro 
115 120 125 

Leu Val Lys Ala Phe Cys Ala Lys His Gly Leu His Tyr Glu Val 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCAAGCTTCT GCAGGAGCTC ttttTTTTTT TTTTT 35 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CUACUACUAC UAGGAGTCCT CTACGGTGTT TTG 33 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAUCAUCAUC AUATGATGCT CAAGCTGAAA CTG 33 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TACCAACTCG AG AAAATG G C TGCTGCTCCC AGTGTGAGG 39 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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20 



45 



50 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AACTGATCTA GATTACTGCG CCTTACCCAT CTTGGAGGC 39 
(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



O 

=J ^ TACCAACTCG AGAAAATGGC ACCTCCCAAC ACTATCGAT 39 

j;^ (2) INFORMATION FOR SEQ ID NO: 18: 

"•4 <i> SEQUENCE CHARACTERISTICS: 

i'j - A (A) LENGTH: 39 base pairs 

is. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



p <iiJ MOLECULE Ty PE* other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
1 4 ^ AACTGATCTA GATTACTTCT TGAAAAAGAC CACGTCTCC 39 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 nucleic acids 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS: not relevant 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



CGTATGTCAC TCCATTCCAA ACTCGTTCAT GGTATCATAA AT AT C AAC AC ATTTACGCTC 60 

H CACTCCTCTA TGGTATTTAC ACACTCAAAT ATCGTACTCA AGATTGGGAA GCTTTTGTAA 120 

AGGATGGTAA AAATGGT GC A ATTCGTGTTA GTGTCGCCAC AAATTT CGAT AAGGCCGCTT 180 

ACGTCATTGG TAAATTGTCT TTTGTTTTCT TCCGTTTCAT CCTTCCACTC CGTTATCATA 240 

GCTTTACAGA TTTAATTTGT TATTTCCTCA TTGCTGAATT CGTCTTTGGT TGGTATCTCA 300 

*n CAATTAATTT CCAAGTTAGT CATGTCGCTG AAGAT CTCAA ATTCTTTGCT ACCCCTGAAA 3 60 

OU GACCAGATGA ACCATCTCAA ATCAATGAAG ATTGGGCAAT CCTTCAACTT AAAACTACTC 420 

AAGATTATGG TCATGGTTCA CTCCTTTGTA CCTTTTTTAG TGGTTCTTTA AATCATCAAG 4 80 

TTGTTCATCA TTTATTCCCA TCAATTGCTC AAGATTTCTA CCCACAACTT GTACCAATTG 540 

^™™ T TTGTAAAGAA CATAACATTA CTTACCACAT TAAACCAAAC TTCACTGAAG 600 

- CTATTATGTC ACACATTAAT TACCTTTACA AAATGGGTAA TGATCCAGAT TATGTTAAAA 660 

65 AACCATTAGC CTCAAAAGAT GATTAAATGA AATAACTTAA AAACCAATTA TTTaS™ 720 
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ACAAACAGTA ATATTAATAA ATACAA 



(2) INFORMATION FOR SEQ ID NO; 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Tyr 


Val 


Thr 


Pro 


Phe 


Gin 


Thr 


Arg 


Ser 




i yi 


His 


ys 


Tyr 


oJ.Il 


1 








5 










10 










15 


His 


He 


Tyr 


Ala 


Pro 


Leu 


Leu 


Tvr 


Glv 


He 


ryr 


Thr 


L u 
eu 


Lys 


Tyr 










20 










25 












Arg 


Thr 


Gin 


Asp 


Trp 


Glu 


Ala 


Phe 


Val 


Lvs 


Asp 


Glv 


Lys 


Asn 


Gly 










35 










40 










45 


Ala 


He 


Arg 


Val 


Ser 


Val 


Ala 


Thr 


Asn 


Phe 


Asp 


Lys 


Ala 


Ala 


Tyr 










50 










55 










60 


Val 


He 


Gly 


Lys 


Leu 


Ser 


Phe 


Val 


Phe 


Phe 


Arg 


Phe 


He 


Leu 


Pro 










65 










70 










75 


Leu 


Arg 


Tyr 


His 


Ser 


Phe 


Thr 


Asp 


Leu 


He 


Cys 


Tyr 


Phe 


Leu 


He 










80 










85 










90 


Ala 


Glu 


Phe 


Val 


Phe 


Gly 


Trp 


Tyr 


Leu 


Thr 


He 


Asn 


Phe 


Gin 


Val 










95 










100 










105 


Ser 


His 


Val 


Ala 


Glu 


Asp 


Leu 


Lys 


Phe 


Phe 


Ala 


Thr 


Pro 


Glu 


Arg 










110 










115 










120 


Pro 


Asp 


Glu 


Pro 


Ser 


Gin 


He 


Asn 


Glu 


Asp 


Trp 


Ala 


He 


Leu 


Gin 










125 










130 










135 


Leu 


Lys 


Thr 


Thr 


Gin 


Asp 


Tyr 


Gly 


His 


Gly 


Ser 


Leu 


Leu 


Cys 


Thr 










140 










145 








150 


Phe 


Phe 


Ser 


Gly 


Ser 


Leu 


Asn 


His 


Gin 


Val 


Val 


His 


His 


Leu 


Phe 










155 










160 










165 


Pro 


Ser 


He 


Ala 


Gin 


Asp 


Phe 


Tyr 


Pro 


Gin 


Leu 


Val 


Pro 


He 


Val 










170 










175 










180 


Lys 


Glu 


Val 


Cys 


Lys 


Glu 


His 


Asn 


He 


Thr 


Tyr 


His 


He 


Lys 


Pro 










185 










190 








195 


Asn 


Phe 


Thr 


Glu 


Ala 


He 


Met 


Ser 


His 


He 


Asn 


Tyr 


Leu 


Tyr 


Lys 










200 










205 








210 


Met 


Gly 


Asn 


Asp 


Pro 


Asp 


Tyr 


Val 


Lys 


Lys 


Pro 


Leu 


Ala 


Ser 


Lys 










215 










220 










225 



Asp Asp *** 



(2) INFORMATION FOR SEQ ID NO 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 



TTTTGGAAGG NTCCAAGTTN ACCACGGANT 
CCCCCCAAGC CTTTTGTCGA CTGGTTCTGT 
TTATTCCCCA GCCTGCCCCG ACACAATCTG 
TGCAAGGAGT GGGGTGTCCA GTACCACGAA 
TTGCACCATT TGGGCAGCGT GGCCGGCGAA 



NGGCAAGTTN ACGGGGCGGA AANCGGTTTT 
GGTGGCTTCC AGTACCAAGT CGACCACCAC 
GCCAAGACAC ACGCACTGGT CGAATCGTTC 
GCCGACCTCG TGGACGGGAC CATGGAAGTC 
TTCGTCGTGG ATTTTGTACG CGACGGACCC 
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GCCATGTAAT CGTCGTTCGT GACGATGCAA GGGTTCACGC ACATCTACAC ACACTCACTC 3 60 

ACACAACTAG TGTAACTCGT ATAGAATTCG GTGTCGACCT GGACCTTGTT TGACTGGTTG 4 20 

GGGATAGGGT AGGTAGGCGG ACGCGTGGGT CGNCCCCGGG AATTCTGTGA CCGGTACCTG 4 80 

GCCCGCGTNA AAGT AQA 



15 



20 



^ 25 

y 

P 30 

; : 

U 35 



E iaf 



(2) INFORMATION FOR SEQ ID NO:22: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



50 



55 



60 



-.65 



(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Phe 


Trp 


Lys 


Xxx 


Pro 


Ser 


Xxx 


Pro 


Arg 


Xxx 


Xxx 


Gin 


Val 


Xxx 


Gly 


1 








5 










10 










15 


Ala 


Glu 


Xxx 


Gly 


Phe 


Pro 


Pro 


Lys 


Pro 


Phe 


Val 


Asp 


Trp 


Phe 


Cys 


Gly 








20 










25 










30 


Gly 


Phe 


Gin 


Tyr 
35 


Gin 


Val 


Asp 


His 


His 
40 


Leu 


Phe 


Pro 


Ser 


Leu 
45 


Pro 


Arg 


His 


Asn 


Leu 


Ala 


Lys 


Thr 


His 


Ala 


Leu 


Val 


Glu 


Ser 


Phe 


Cys 








50 










55 










60 


Lys 


Glu 


Trp 


Gly 


Val 


Gin 


Tyr 


His 


Glu 


Ala 


Asp 


Leu 


Val 


Asp 


Gly 








65 










70 










75 


Thr 


Met 


Glu 


Val 


Leu 


His 


His 


Leu 


Gly 


Ser 


Val 


Ala 


Gly Glu 


Phe 








65 










70 










75 


Val 


Val 


Asp 


Phe 
80 


Val 


Arg 


Asp 


Gly 


Pro 
85 


Ala 


Met 









40 (2) INFORMATION FOR SEQ ID NO: 23; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 nucleic acids 

(B) TYPE : amino acid 

45 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



— w*^wv,w\uo ^wWjt^tAaA 
TTAATTCCCC ACCCCACCCC ATGTTCTGTC TTCCTCCCGC 

(2) INFORMATION FOR SEQ ID NO:24: 
(i) SEQUENCE CHARACTERISTICS: 
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GTTTAAGCGT 


CATGGGTGCG 


60 


GCGCCTTTGG 


TCTCGGCTGC 


120 


TGCCCGTGAG 


CAACCCGGAG 


180 


GAACATCAGC 


ACCAAGTCGT 


240 


CGAGCACCAC 


CTTTTCCCCA 


300 


CGAGGCCCTC 


TTCAAGCGCC 


360 


CTCCACCACC 


TTTGCCAACC 


420 


CTAGCCTCTT 


TTCCTAGACC 


480 






520 



(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



Met 


Glu 


Phe 


Val 


Trp 


He 


Ala 


Val 


Arg 


Tyr 


Ala 


Thr 


Trp 


Phe 


Lys 


1 








5 










10 










15 


Arg 


His 


Gly 


Cys 


Ala 


Trp 


Val 


His 


Ala 


Gly 


Ala 


Val 


Val 


Gly 


His 










20 










25 








30 


Val 


Leu 


Val 


Arg 


Leu 
35 


Trp 


Ser 


Arg 


Leu 


His 
40 


Leu 


His 


Phe 


Ser 


Ala 
45 


Val 


Arg 


Arg 


Lys 


Ser 


His 


Pro 


Phe 


Ala 


Arg 


Glu 


Gin 


Pro 


Gly 


Gly 










50 










55 








60 


Ser 


Ala 


Ala 


Leu 


Ala 
65 


Arg 


Val 


Arg 


Ala 


Asp 
70 


His 


Thr 


Val 


Asn 


He 
75 


Ser 


Thr 


Lys 


Ser 


Trp 
80 


Phe 


Val 


Thr 


Trp 


Trp 
85 


Met 


Ser 


Asn 


Leu 


Asn 
90 


Phe 


Gin 


He 


Glu 


His 
95 


His 


Leu 


Phe 


Pro 


Thr 
100 


Ala 


Pro 


Gin 


Phe 


Arg 
105 


Phe 


Lys 


Glu 


He 


Ser 
110 


Pro 


Arg 


Val 


Glu 


Ala 
115 


Leu 


Phe 


Lys 


Arg 


His 
120 


Gly 


Leu 


Pro 


Tyr 


Tyr 
125 


Asp 


Met 


Pro 


Tyr 


Thr 
130 


Ser 


Ala 


Val 


Ser 


Thr 
135 


Thr 


Phe 


Ala 


Asn 


Leu 
140 


Tyr 


Ser 


Val 


Gly 


His 
145 


Ser 


Val 


Gly 


Asp 


Ala 
150 



Lys Arg Asp 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



ACGCGTCCGC CCACGCGTCC 
GCTCCCGCAC ATGACGTACC 
GCTGATGGGT CAGTCTTCAC 
TCAGGGTCGC TGCGGCTGGG 
TTGGCTTGAC GACCGGTTGT 
TTACTGGAAA AACCAGCACA 
AGATCTCAAC ACCTTGCCAT 



GCCGCGAGCA ACTCATCAAG 
GCGTGGTCGA GATTGTTGTT 
CCCTCGCGCT CGCTCTCGGC 
TAATGCATGA GATGGGCCAT 
GCGAGTTCTT TTACGGCGTT 
GCAAACACCA CGCAGCGCCA 
TGGTGGCCTT CAACGAGCGC 



GAAGGCTACT TTGACCCCTC 
CTCTTCGTGC TTTCCTTTTG 
ATTGTCGTCA GCGGCATCTC 
GGGTCGTTCA CTGGTGTCAT 
GGTTGTGGCA TGAGCGGTCA 
AACCGGCTCG AGCACGATGT 
GTCGTGCGCA AGGTCCGACC 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



10 



15 



20 



25 



60 



Arg 


Val 


Arg 


Pro 


Arg 


Val 


Arg 


Arg 


Glu 


Gin 


Leu 


He 


Lys 


Glu 


Gly 


1 


Phe 






5 










10 








15 


Tyr 


Asp 


Pro 


Ser 


Leu 


Pro 


His 


Met 


Thr 


Tyr 


Arg 


Val 


Val 


Glu 


He 








20 










25 








30 


Val 


Val 


Leu 


Phe 


Val 


Leu 


Ser 


Phe 


Trp 


Leu 


Met 


Gly 


Gin 


Ser 










35 










40 








45 


Ser 


Pro 


Leu 


Ala 




nld 


Leu 


Gly 


He 


Val 


Val 


Ser 


Gly 


He 


Ser 










50 










55 








60 


Gin 


Gly 


Arg 


Cys 


Gly 


Trp 


Val 


Met 


His 


Glu 


Met 


Gly 


His 


Gly 


Ser 










65 










70 






75 


Phe 


Thr 


Gly 


Val 


He 


Trp 


Leu 


Asp 


Asp 


Arg 


Leu 


Cys 


Glu 


Phe 


Phe 










65 










70 








75 


Tyr 


Gly 


Val 


Gly 


Cys 


Gly 


Met 


Ser 


Gly 


His 


Tyr 


Trp 


Lys 


Asn 


Gin 


His 








80 










85 








90 


Ser 


Lys 


His 


His 


Ala 


Ala 


Pro 


Asn 


Arg 


Leu 


Glu 


His 


Asp 


Val 


Asp 








95 










100 








105 


Leu 


Asn 


Thr 


Leu 


Pro 


Leu 


Val 


Ala 


Phe 


Asn 


Glu 


Arg 


Val 


Val 


Arg 








110 










115 








120 


Lys 


Val 


Arg 


Pro 




























125 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:27: 















(i) SEQUENCE CHARACTERISTICS: 
- A (A) LENGTH: 1219 base pairs 

JU (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 {ii> MOLECULE TYPE: other nucleic acid (Edited Contig 2692004) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

40 GCACGCCGAC CGGCGCCGGG AGATCCTGGC AAAGTATCCA GAGATAAAGT CCTTGATGAA 60 

ACCTGATCCC AATTTGATAT GGATTATAAT TATGATGGTT CTCACCCAGT TGGGTGCATT 120 

45 TTACATAGTA AAAGACTTGG ACTGGAAATG GGTCATATTT GGGGCCTATG CGTTTGGCAG 180 

TTGCATTAAC CACTCAATGA CTCTGGCTAT TCATGAGATT GCCCACAATG CTGCCTTTGG 24 0 

CAACTGCAAA GCAATGTGGA ATCGCTGGTT TGGAATGTTT GCTAATCTTC CTATTGGGAT 300 

50 TCCATATTCA ATTTCCTTTA AGAGGTATCA CATGGATCAT CATCGGTACC TTGGAGCTGA 360 

TGGCGTCGAT GTAGATATTC CTACCGATTT TGAGGGCTGG TTCTTCTGTA CCGCTTTCAG 420 

55 AAAGTTTATA TGGGTTATTC TTCAGCCTCT CTTTTATGCC TTTCGACCTC TGTTCATCAA 480 

CCCCAAACCA ATTACGTATC TGGAAGTTAT CAATACCGTG GCACAGGTCA CTTTTGACAT 540 

TTTAATTTAT TACTTTTTGG GAATTAAATC CTTAGTCTAC ATGTTGGCAG CATCTTTACT 600 

TGGCCTGGGT TTGCACCCAA TTTCTGGACA TTTTATAGCT GAGCATTACA TGTTCTTAAA 660 

GGGTCATGAA ACTTACTCAT ATTATGGGCC TCTGAATTTA CTTACCTTCA ATGTGGGTTA 720 

^ 5 TCATAATGAA CATCATGATT TCCCCAACAT TCCTGGAAAA AGTCTTCCAC TGGTGAGGAA 780 

AATAGCAGCT GAATACTATG ACAACCTCCC TCACTACAAT TCCTGGATAA AAGTACTGTA 840 

-112- 



TGATTTTGTG ATGGATGATA CAATAAGTCC CTACTCAAGA ATGAAGAGGC ACCAAAAAGG 900 



AGAGATGGTG CTGGAGTAAA TAT CAT T AG T GCCAAAGGGA TTCTTCTCCA AAACTTTAGA 960 

TGATAAAATG GAATTTTTGC ATTATTAAAC TTGAGACCAG TGATGCTCAG AAGCTCCCCT 1020 

GGCACAATTT CAGAGTAAGA GCTCGGTGAT ACCAAGAAGT GAATCTGGCT TTTAAACAGT 1080 

CAGCCTGACT CTGTACTGCT CAGTTTCACT CACAGGAAAC TTGTGACTTG TGTATTATCG 114 0 

TCATTGAGGA TGTTTCACTC ATGTCTGTCA TTTTATAAGC ATATCATTTA AAAAGCTTCT 1200 

AAAAAGCTAT TTCGCCAGG 1219 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



TTACCTTCTA 


CGTCCGCTTC 


TTCCTCACTT 


ATGTGCCACT 


ATTGGGGCTG 


AAAGCTTCCT 


60 


GGGCCTTTTC 


TTCATAGTCA 


GGTTCCTGGA 


AAGCAACTGG 


TTTGTGTGGG 


TGACACAGAT 


120 


GAACCATATT 


CCCATGCACA 


TTGATCATGA 


CCGGAACATG 


GACTGGGTTT 


CCACCCAGCT 


180 


CCAGGCCACA 


TGCAATGTCC 


ACAAGTCTGC 


CTTCAATGAC 


TGGTTCAGTG 


GACACCTCAA 


240 


CTTCCAGATT 


GAGCACCATC 


TTTTTCCCAC 


GATGCCTCGA 


CACAATTACC 


ACAAAGTGGC 


300 


TCCCCTGGTG 


CAGTCCTTGT 


GTGCCAAGCA 


TGGCATAGAG 


TACCAGTCCA 


AGCCCCTGCT 


360 


GTCAGCCTTC 


GCCGACATCA 


TCCACTCACT 


AAAGGAGTCA 


GGGCAGCTCT 


GGCTAGATGC 


420 


CTATCTTCAC 


CAATAACAAC 


AGCCACCCTG 


CCCAGTCTGG 


AAGAAGAGGA 


GGAAGACTCT 


480 


GGAGCCAAGG 


CAGAGGGGAG 


CTTGAGGGAC 


AATGCCACTA 


TAGTTTAATA 


CTCAGAGGGG 


540 


GTTGGGTTTG 


G GGAC AT AAA 


GCCTCTGACT 


CAAACTCCTC 


CCTTTTATCT 


TCTAGCCACA 


600 


GTTCTAAGAC 


CCAAAGTGGG 


GGGTGGACAC 


AGAAGTCCCT 


AGGAGGGAAG 


GAGCT 


655 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 
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TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 
CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 
AACTGGTGGA ATCATCGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 
CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 
AAGA 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CAGGGACCTA 


CCCCGCGCTA 


CTTCACCTGG 


GACGAGGTGG 


CCCAGCGCTC 


AGGGTGCGAG 


GAGCGGTGGC 


TAGTGATCGA 


CCGTAAGGTG 


TACAACATCA 


GCGAGTTCAC 


CCGCCGGCAT 


CCAGGGGGCT 


CCCGGGTCAT 


CAGCCACTAC 


GCCGGGCAGG 


ATGCCACGGA 


TCCCTTTGTG 


GCCTTCCACA 


TCAACAAGGG 


CCTTGTGAAG 


AAGTATATGA 


ACTCTCTCCT 


GATTGGAGAA 


CTGTCTCCAG 


AGCAGCCCAG 


CTTTGAGCCC 


ACCAAGAATA 


AAGAGCTGAC 


AGATGAGTTC 


CGGGAGCTGC 


GGGCCACAGT 


GGAGCGGATG 


GGGCTCATGA 


AGGCCAACCA 


TGTCTTCTTC 


CTGCTGTACC 


TGCTGCACAT 


CTTGCTGCTG 


GATGGTGCAG 


CCTGGCTCAC 


CCTTTGGGTC 


TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGGCC 


CAGGCTGGCT 


GGCTGCAGCA 


TGACTTTGGG 


CACCTGTCGG 


TCTTCAGCAC 


CTCAAAGTGG 


AACCATCTGC 


TACATCATTT 


TGTGATTGGC 


CACCTGAAGG 


GGGCCCCCGC 


CAGTTGGTGG 


AACCACATGC 


ACTTCCAGCA 


CCATGCCAAG 


CCCAACTGCT 


TCCGCAAAGA 


CCCAGACATC 


AACATGCATC 


CCTTCTTCTT 


TGCCTTGGGG 


AAGATCCTCT 


CTGTGGAGCT 


TGGGAAACAG 


AAGAAAAAAT 


ATATGCCGTA 


CAACCACCAG 


CACARATACT 


TCTTCCTAAT 


TGGGCCCCCA 


GCCTTGCTGC 


CTCTCTACTT 


CCAGTGGTAT 


ATTTTCTATT 


TTGTTATCCA 


GCGAAAGAAG 


TGGGTGGACT 


TGGCCTGGAT 


CAGCAAACAG 


GAATACGATG 


AAGCCGGGCT 


TCCATTGTCC 


ACCGCAAATG 


CTTCTAAA 











(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid (Edited Contig 2511785) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 





5 


GCCACTTAAA 


GGGTGCCTCT 


GCCAACTGGT 


GGAATCATCG 


CCACTTCCAG 


CACCACGCCA 


60 






AGCCTAACAT 


CTTCCACAAG 


GATCCCGATG 


TGAACATGCT 


GCACGTGTTT 


GTTCTGGGCG 


120 




10 


AATGGCAGCC 


CATCGAGTAC 


GGCAAGAAGA 


AGCTGAAATA 


CCTGCCCTAC 


AATCACCAGC 


180 




AC G AAT ACT T 


CTTCCTGATT 


GGGCCGCCGC 


TGCTCATCCC 


C AT GT AT T T C 


CAGTACCAGA 


240 






T CAT CAT G AC 


CATGATCGTC 


CATAAGAACT 


GGGTGGACCT 


GGCCTGGGCC 


GTCAGCTACT 


300 




15 


ACATCCGGTT 


CTTCATCACC 


TACATCCCTT 


TCTACGGCAT 


CCTGGGAGCC 


CTCCTTTTCC 


360 






TCAACTTCAT 


CAGGTTCCTG 


GAGAGCCACT 


GGTTTGTGTG 


GGTCACACAG 


ATGAATCACA 


420 




20 


TCGTCATGGA 


GATTGACCAG 


GAGGCCTACC 


GTGACTGGTT 


CAGTAGCCAG 


CTGACAGCCA 


480 




CCTGCAACGT 


GGAGCAGTCC 


TTCTTCAACG 


ACTGGTTCAG 


TGGACACCTT 


AACTTCCAGA 


540 


a 




TTGAGCACCA 


CCTCTTCCCC 


ACCATGCCCC 


GGCACAACTT 


AC AC AAG AT C 


GCCCCGCTGG 


600 




25 


TGAAGTCTCT 


ATGTGCCAAG 


CATGGCATTG 


AATACCAGGA 


GAAGCCGCTA 


CTGAGGGCCC 


660 






TGCTGGACAT 


CATCAGGTCC 


CTGAAGAAGT 


CTGGGAAGCT 


GTGGCTGGAC 


GCCTACCTTC 


720 


=i 


30 


ACAAATGAAG 


CCACAGCCCC 


CGGGACACCG 


TGGGGAAGGG 


GTGCAGGTGG 


GGTGATGGCC 


780 


|: c; 
i : 5 




AGAGGAATGA 


TGGGCTTTTG 


TTCTGAGGGG 


TGTCCGAGAG 


GCTGGTGTAT 


GCACTGCTCA 


840 


^; ^ 




CGGACCCCAT 


GTTGGATCTT 


TCTCCCTTTC 


TCCTCTCCTT 


TTTCTCTTCA 


CATCTCCCCC 


900 


=;sr 


35 


AT AGCAC CCT 


GCCCTCATGG 


GACCTGCCCT 


CCCTCAGCCG 


TCAGCCATCA 


GCCATGGCCC 


960 


:' : *^ 




TCCCAGTGCC 


TCCTAGCCCC 


TTCTTCCAAG 


GAGCAGAGAG 


GTGGCCACCG 


GGGGTGGCTC 


1020 


in 


40 


TGTCCTACCT 


CCACTCTCTG 


CCCCTAAAGA 


TGGGAGGAGA 


CCAGCGGTCC 


ATGGGTCTGG 


1080 






CCTGTGAGTC 


TCCCCTTGCA 


GCCTGGTCAC 


TAGGCATCAC 


CCCCGCTTTG 


GTTCTTCAGA 


1140 






TGCTCTTGGG 


GTTCATAGGG 


GCAGGTCCTA 


GTCGGGCAGG 


GCCCCTGACC 


CTCCCGGCCT 


1200 




45 


GGCTTCACTC 


TCCCTGACGG 


CTGCCATTGG 


TCCACCCTTT 


CATAGAGAGG 


CCTGCTTTGT 


1260 






T AC AAAGCT C 


GGGTCTCCCT 


CCTGCAGCTC 


GGTTAAGTAC 


CCGAGGCCTC 


TCTTAAGATG 


1320 




50 


TCCAGGGCCC 


CAGGCCCGCG 


GGCACAGCCA 


GCCCAAACCT 


TGGGCCCTGG 


AAGAGTCCTC 


1380 




CACCCCATCA 


CTAGAGTGCT 


CTGACCCTGG 


GCTTTCACGG 


GCCCCATTCC 


ACCGCCTCCC 


1440 






CAACTTGAGC 


CTGTGACCTT 


GGGACCAAAG 


GGGGAGTCCC 


TCGTCTCTTG 


TGACTCAGCA 


1500 




55 


GAGGCAGTGG 


CCACGTTCAG 


GGAGGGGCCG 


GCTGGCCTGG 


AGGCTCAGCC 


CACCCTCCAG 


1560 






CTTTTCCTCA 


GGGTGTCCTG 


AGGTCCAAGA 


TTCTGGAGCA 


ATCTGACCCT 


TCTCCAAAGG 


1620 




60 


CTCTGTTATC 


AGCTGGGCAG 


TGCCAGCCAA 


TCCCTGGCCA 


TTTGGCCCCA 


GGGGACGTGG 


1680 




GCCCTG 












1686 



(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

10 







GTCTTTTACT 


TTGGCAATGG 


CTGGATTCCT 


ACCCTCATCA 


CGGCCTTTGT 


CCTTGCTACC 


60 






TCTCAGGCCC 


AAGCTGGATG 


GCTGCAACAT 


GATTATGGCC 


ACCTGTCTGT 


CTACAGAAAA 


120 




15 


CCCAAGTGGA 


ACCACCTTGT 


CCACAAATTC 


GTCATTGGCC 


ACTTAAAGGG 


TGCCTCTGCC 


180 






AACTGGTGGA 


ATCATCGCCA 


CTTCCAGCAC 


CACGCCAAGC 


CTAACATCTT 


CCACAAGGAT 


240 




20 


CCCGATGTGA 


ACATGCTGCA 


CGTGTTTGTT 


CTGGGCGAAT 


GGCAGCCCAT 


CGAGTACGGC 


300 






AAGAAGAAGC 


TGAAATACCT 


GCCCTACAAT 


CACCAGCACG 


AATACTTCTT 


CCTGATTGGG 


360 


I: J 




CCGCCGCTGC 


TCATCCCCAT 


GTATTTCCAG 


TACCAGATCA 


T CAT G AC CAT 


GATCGTCCAT 


420 


\M 


25 


AAGAACTGGG 


TGGACCTGGC 


CTGGGCCGTC 


AGCTACTACA 


TCCGGTTCTT 


CATCACCTAC 


480 






ATCCCTTTCT 


ACGGCATCCT 


GGGAGCCCTC 


CTTTTCCTCA 


ACTTCATCAG 


GTTCCTGGAG 


540 


" <4 


30 


AGCCACTGGT 


TTGTGTGGGT 


CACACAGATG 


AATCACATCG 


TCATGGAGAT 


TGACCAGGAG 


600 


=***= 




GCCTACCGTG 


ACTGGTTCAG 


T AG CC AG C T G 


ACAGCCACCT 


GCAACGTGGA 


GCAGTCCTTC 


660 


Si 




TTCAACGACT 


GGTTCAGTGG 


ACACCTTAAC 


TTCCAGATTG 


AGCACCACCT 


CTTCCCCACC 


720 


P 

m 


35 


ATGCCCCGGC 


ACAACTTACA 


CAAGATCGCC 


CCGCTGGTGA 


AGTCTCTATG 


TGCCAAGCAT 


780 


5. — 1 




GGCATTGAAT 


ACCAGGAGAA 


GCCGCTACTG 


AGGGCCCTGC 


TGGAC AT CAT 


CAGGTCCCTG 


840 


w 


40 


AAGAAGTCTG 


GGAAGCTGTG 


GCTGGACGCC 


TACCTTCACA 


AATGAAG CCA 


CAGCCCCCGG 


900 






GACACCGTGG 


GGAAGGGGTG 


CAGGTGGGGT 


GATGGCCAGA 


GGAATGATGG 


GCTTTTGTTC 


960 






TGAGGGGTGT 


CCGAGAGGCT 


GGTGTATGCA 


CTGCTCACGG 


ACCCCATGTT 


GGATCTTTCT 


1020 




45 


CCCTTTCTCC 


TCTCCTTTTT 


CTCTTCACAT 


CTCCCCCATA 


GCACCCTGCC 


CTCATGGGAC 


1080 






CTGCCCTCCC 


TCAGCCGTCA 


GCCATCAGCC 


ATGGCCCTCC 


CAGTGCCTCC 


TAGCCCCTTC 


1140 




50 


TTCCAAGGAG 


CAGAGAGGTG 


GCCACCGGGG 


GTGGCTCTGT 


CCTACCTCCA 


CTCTCTGCCC 


1200 






CTAAAGATGG 


GAGGAGACCA 


GCGGTCCATG 


GGTCTGGCCT 


GTGAGTCTCC 


CCTTGCAGCC 


1260 






TGGTCACTAG 


GCATCACCCC 


CGCTTTGGTT 


CTTCAGATGC 


TCTTGGGGTT 


CATAGGGGCA 


1320 




55 


GGTCCTAGTC 


GGGCAGGGCC 


CCTGACCCTC 


CCGGCCTGGC 


TTCACTCTCC 


CTGACGGCTG 


1380 






CCATTGGTCC 


ACCCTTTCAT 


AGAGAGG CCT 


GCTTTGTTAC 


AAAGCTCGGG 


TCTCCCTCCT 


1440 




60 


GCAGCTCGGT 


TAAGTACCCG 


AGGCCTCTCT 


T AAGATGT CC 


AGGGCCCCAG 


GCCCGCGGGC 


1500 






ACAGCCAGCC 


CAAACCTTGG 


GCCCTGGAAG 


AGTCCTCCAC 


CCCATCACTA 


GAGTGCTCTG 


1560 






ACCCTGGGCT 


TTCACGGGCC 


CCATTCCACC 


GCCTCCCCAA 


CTTGAGCCTG 


TGACCTTGGG 


1620 






ACCAAAGGGG 


GAGTCCCTCG 


TCTCTTGTGA 


CTCAGCAGAG 


GCAGTGGCCA 


CGTTCAGGGA 


1680 
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GGGGCCGGCT GGCCTGGAGG CTCAGCCCAC CCTCCAGCTT TTCCTCAGGG TGTCCTGAGG 17 4 0 
TCCAAGATTC TGGAGCAATC TGACCCTTCT CCAAAGGCTC TGTTATCAGC TGGGCAGTGC 1800 
5 CAGCCAATCC CTGGCCATTT GGCCCCAGGG GACGTGGGCC CTG 184 3 

(2) INFORMATION FOR SEQ ID NO: 33: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



15 



20 



9. 30 
52 35 

\: J 
t » 

ill 40 

45 
50 



55 



60 



^65. 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 253538a) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



CAGGGACCTA 


CCCCGCGCTA 


CTTCACCTGG 


GACGAGGTGG 


CCCAGCGCTC 


AGGGTGCGAG 


60 


GAGCGGTGGC 


TAGTGATCGA 


CCGTAAGGTG 


T AC AAC AT C A 


GCGAGTTCAC 


CCGCCGGCAT 


120 


CCAGGGGGCT 


CCCGGGTCAT 


CAGCCACTAC 


GCCGGGCAGG 


ATGCCACGGA 


TCCCTTTGTG 


180 


GCCTTCCACA 


TCAACAAGGG 


CCTTGTGAAG 


AAGTATATGA 


ACTCTCTCCT 


GATTGGAGAA 


240 


CTGTCTCCAG 


AGCAGCCCAG 


CTTTGAGCCC 


ACCAAGAATA 


AAGAGCTGAC 


AGATGAGTTC 


300 


CGGGAGCTGC 


GGGCCACAGT 


GGAGCGGATG 


GGGCTCATGA 


AGGCCAACCA 


TGTCTTCTTC 


360 


CTGCTGTACC 


TG CT GC AC AT 


CTTGCTGCTG 


GATGGTGCAG 


CCTGGCTCAC 


CCTTTGGGTC 


420 


TTTGGGACGT 


CCTTTTTGCC 


CTTCCTCCTC 


TGTGCGGTGC 


TGCTCAGTGC 


AGTTCAGCAG 


480 


GCCCAAGCTG 


GATGGCTGCA 


ACATGATTAT 


GGCCACCTGT 


CTGTCTACAG 


AAAACCCAAG 


540 


TGGAACCACC 


TTGTCCACAA 


ATTCGTCATT 


GGCCACTTAA 


AGGGTGCCTC 


TGCCAACTGG 


600 


TGGAATCATC 


GCCACTTCCA 


GCACCACGCC 


AAGCCTAACA 


TCTTCCACAA 


GGATCCCGAT 


660 


GTGAACATGC 


TGCACGTGTT 


TGTTCTGGGC 


GAATGGCAGC 


CC AT CGAGT A 


CGGCAAGAAG 


720 


AAGCTGAAAT 


ACCTGCCCTA 


CAATCACCAG 


CACGAATACT 


TCTTCCTGAT 


TGGGCCGCCG 


780 


CTGCTCATCC 


CCATGTATTT 


CCAGTACCAG 


AT CAT C ATG A 


CCATGATCGT 


CCATAAGAAC 


840 


TGGGTGGACC 


TGGCCTGGGC 


CGTCAGCTAC 


TACATCCGGT 


TCTTCATCAC 


CTACATCCCT 


900 


TTCTACGGCA 


TCCTGGGAGC 


CCTCCTTTTC 


CTCAACTTCA 


TCAGGTTCCT 


GGAGAGCCAC 


960 


TGGTTTGTGT 


GGGTCACACA 


GATGAATCAC 


ATCGTCATGG 


AGATTGACCA 


GGAGGCCTAC 


1020 


CGTGACTGGT 


TCAGTAGCCA 


GCTGACAGCC 


ACCTGCAACG 


TGGAGCAGTC 


CTT CTTCAAC 


1080 


GACTGGTTCA 


GTGGACACCT 


TAACTTCCAG 


ATTGAGCACC 


ACCTCTTCCC 


CACCATGCCC 


1140 


CGGCACAACT 


TACACAAGAT 


CGCCCCGCTG 


GTGAAGTCTC 


TATGTGCCAA 


GCATGGCATT 


1200 


GAATACCAGG 


AGAAGCCGCT 


ACTGAGGGCC 


CTG CTGGACA 


TCATCAGGTC 


CCT GAAGAAG 


1260 


TCTGGGAAGC 


TGTGGCTGGA 


CGCCTACCTT 


CACAAATGAA 


GCCACAGCCC 


CCGGGACACC 


1320 


GTGGGGAAGG 


GGTGCAGGTG 


GGGTGATGGC 


CAGAGGAATG 


ATGGGCTTTT 


GTTCTGAGGG 


1380 


GTGTCCGAGA 


GGCTGGTGTA 


TGCACTGCTC 


ACGGACCCCA 


TGTTGGATCT 


TTCTCCCTTT 


1440 
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CTCCTCTCCT 


TTTTCTCTTC 


ACATCTCCCC 


CATAGCACCC 


TGCCCTCATG 


GGACCTGCCC 


1500 


TCCCTCAGCC 


GTCAGCCATC 


AGCCATGGCC 


CTCCCAGTGC 


CTCCTAGCCC 


CTTCTTCCAA 


1560 


G G AGC AG AG A 


GGTGGCCACC 


GGGGGTGGCT 


CTGTCCTACC 


TCCACTCTCT 


GCCCCTAAAG 


1620 


ATGGGAGGAG 


ACCAGCGGTC 


CATGGGTCTG 


GCCTGTGAGT 


CTCCCCTTGC 


AGCCTGGTCA 


1680 


CTAGGCATCA 


CCCCCGCTTT 


GGTTCTTCAG 


ATGCTCTTGG 


GGTTCATAGG 


GGCAGGTCCT 


1740 


AGTCGGGCAG 


GGCCCCTGAC 


CCTCCCGGCC 


TGGCTTCACT 


CTCCCTGACG 


GCTGCCATTG 


1800 


GTCCACCCTT 


TCATAGAGAG 


GCCTGCTTTG 


TTACAAAGCT 


CGGGTCTCCC 


TCCTGCAGCT 


1860 


CGGTTAAGTA 


CCCGAGGCCT 


CTCTTAAGAT 


GTCCAGGGCC 


CCAGGCCCGC 


GGGCACAGCC 


1920 


AGCCCAAACC 


TTGGGCCCTG 


GAAGAGTCCT 


CCACCCCATC 


ACTAGAGTGC 


TCTGACCCTG 


1980 


GGCTTTCACG 


GGCCCCATTC 


CACCGCCTCC 


CCAACTTGAG 


CCTGTGACCT 


TGGG AC C AAA 


2040 


GGGGGAGTCC 


CTCGTCTCTT 


GTGACTCAGC 


AGAGGCAGTG 


GCCACGTTCA 


GGGAGGGGCC 


2100 


GGCTGGCCTG 


GAGGCTCAGC 


CCACCCTCCA 


GCTTTTCCTC 


AGGGTGTCCT 


GAGGTCCAAG 


2160 


ATTCTGGAGC 


AATCTGACCC 


TTCTCCAAAG 


GCTCTGTTAT 


CAGCTGGGCA 


GTGCCAGCCA 


2220 


ATCCCTGGCC 


ATTTGGCCCC 


AGGGGACGTG 


GGCCCTG 






2357 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2692004) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:34: 



His 


Ala 


Asp 


Arg 


Arg 


Arg 


Glu 


He 


Leu 


Ala 


Lys 


Tyr 


Pro 


Glu 


He 


1 








5 










10 








15 


Lys 


Ser 


Leu 


Met 


Lys 


Pro 


Asp 


Pro 


Asn 


Leu 


He 


Trp 


He 


He 


He 










20 










25 








30 


Met 


Met 


Val 


Leu 


Thr 


Gin 


Leu 


Gly 


Ala 


Phe 


Tyr 


He 


Val 


Lys 


Asp 


Leu 








35 










40 








45 


Asp 


Trp 


Lys 


Trp 


Val 


He 


Phe 


Gly 


Ala 


Tyr 


Ala 


Phe 


Gly 


Ser 


Cys 


He 






50 










55 








60 


Asn 


His 


Ser 
65 


Met 


Thr 


Leu 


Ala 


He 
70 


His 


Glu 


He 


Ala 


His 
75 


Asn 


Ala 


Ala 


Phe 


Gly 


Asn 


Cys 


Lys 


Ala 


Met 


Trp 


Asn 


Arg 


Trp 


Phe 










80 










85 








90 


Gly Met 


Phe 


Ala 


Asn 


Leu 


Pro 


He 


Gly 


He 


Pro 


Tyr 


Ser 


He 


Ser 


Phe 








95 










100 








105 


Lys 


Arg 


Tyr 


His 
110 


Met 


Asp 


His 


His 


Arg 
115 


Tyr 


Leu 


Gly 


Ala 


Asp 
120 


Gly Val 


Asp 


Val 


Asp 


He 


Pro 


Thr 


Asp 


Phe 


Glu 


Gly 


Trp 


Phe 


Phe 


Cys 


Thr 






125 










130 








135 


Ala 


Phe 


Arg 


Lys 


Phe 


He 


Trp Val 


He 


Leu 


Gin 


Pro 


Leu 


Phe 








140 










145 










150 


Tyr 


Ala 


Phe 


Arg 


Pro 


Leu 


Phe 


He 


Asn 


Pro 


Lys 


Pro 


He 


Thr 


Tyr 


Leu 






155 










160 








165 


Glu 


Val 


He 


Asn 


Thr 


Val 


Ala 


Gin 


Val 


Thr 


Phe 


Asp 


He 
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170 










175 










180 


Leu 


He 


Tyr 


Tyr 


Phe 
185 


Leu 


Gly 


He 


Lys 


Ser 
190 


Leu 


Val 


Tyr 


Met 


Leu 
195 


Ala 


Ala 


Ser 


Leu 


Leu 
200 


Gly 


Leu 


Gly 


Leu 


His 
205 


Pro 


He 


Ser 


Gly 


His 
210 


Phe 


He 


Ala 


Glu 


His 
215 


Tyr 


Met 


Phe 


Leu 


Lys 
220 


Gly 


His 


Glu 


Thr 


Tyr 
225 


Ser 


Tyr 


Tyr 


Gly 


Pro 
230 


Leu 


Asn 


Leu 


Leu 


Thr 
235 


Phe 


Asn 


Val 


Gly 


Tyr 
240 


His 


Asn 


Glu 


His 


His 
245 


Asp 


Phe 


Pro 


Asn 


He 
250 


Pro 


Gly 


Lys 


Ser 


Leu 
255 


Pro 


Leu 


Val 


Arg 


Lys 
260 


He 


Ala 


Ala 


Glu 


Tyr 
265 


Tyr 


Asp 


Asn 


Leu 


Pro 
270 


His 


Tyr 


Asn 


Ser 


Trp 
275 


He 


Lys 


Val 


Leu 


Tyr 
280 


Asp 


Phe 


Val 


Met 


Asp 
285 


Asp 


Thr 


He 


Ser 


Pro 
290 


Tyr 


Ser 


Arg 


Met 


Lys 
295 


Arg 


His 


Gin 


Lys 


Gly 
300 


Glu 


Met 


Val 


Leu 


Glu 
305 


* + * 


He 


Ser 


Leu 


Val 
310 


Pro 


Lys 


Gly 


Phe 


Phe 
315 


Ser 


Lys 


Thr 


Leu 


Asp 


Asp 


Lys 


Met 


Glu 


Phe 


Leu 


His 


Tyr 


* * * 


Thr 










320 










325 








330 


+ *■ ★ 


Asp 


Gin 


* *• * 


Cys 
335 


Ser 


Glu 


Ala 


Pro 


Leu 
340 


Ala 


Gin 


Phe 


Gin 


Ser 
345 


Lys 


Ser 


Ser 


Val 


He 
350 


Pro 


Arg 


Ser 


Glu 


Ser 
355 


Gly 


Phe 


* * * 


Thr 


Val 
360 


Ser 


Leu 


Thr 


Leu 


Tyr 
365 


Cys 


Ser 


Val 


Ser 


Leu 
370 


Thr 


Gly 


Asn 


Leu 


★ * * 

375 


Leu 


Val 


Tyr 


Tyr 


Arg 
380 


His 


* * + 


Gly 


Cys 


Phe 
385 


Thr 


His 


Val 


Cys 


His 
390 


Phe 


He 


Ser 


He 


Ser 
400 


Phe 


Lys 


Lys 


Leu 


Leu 
405 


Lys 


Ser 


Tyr 


Phe 


Ala 
410 



Arg 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



Tyr 


Leu 


Leu 


Arg 


Pro 


Leu 


Leu 


Pro 


His 


Leu 


Cys 


Ala 


Thr 


He 


Gly 


1 








5 










10 










15 


Ala 


Glu 


Ser 


Phe 


Leu 


Gly 


Leu 


Phe 


Phe 


He 


Val 


Arg 


Phe 


Leu 


Glu 










20 










25 








30 


Ser 


Asn 


Trp 


Phe 


Val 
35 


Trp 


Val 


Thr 


Gin 


Met 
40 


Asn 


His 


He 


Pro 


Met 
45 


His 


He 


Asp 


His 


Asp 
50 


Arg 


Asn 


Met 


Asp 


Trp 
55 


Val 


Ser 


Thr 


Gin 


Leu 
60 


Gin 


Ala 


Thr 


Cys 


Asn 


Val 


His 


Lys 


Ser 


Ala 


Phe 


Asn 


Asp 


Trp 


Phe 










65 










70 








75 


Ser 


Gly 


His 


Leu 


Asn 
80 


Phe 


Gin 


He 


Glu 


His 
85 


His 


Leu 


Phe 


Pro 


Thr 
90 


Met 


Pro 


Arg 


His 


Asn 
95 


Tyr 


His 


Lys 


Val 


Ala 
100 


Pro 


Leu 


Val 


Gin 


Ser 
105 


Leu 


Cys 


Ala 


Lys 


His 
110 


Gly 


He 


Glu 


Tyr 


Gin 
115 


Ser 


Lys 


Pro 


Leu 


Leu 
120 


Ser 


Ala 


Phe 


Ala 


Asp 


He 


He 


His 


Ser 


Leu 


Lys 


Glu 


S r 


Gly 


Gin 










125 










130 








135 


Leu 


Trp 


Leu 


Asp 


Ala 
140 


Tyr 


Leu 


His 


Gin 


* ** 
145 


Gin 


Gin 


Pro 


Pro 


Cys 
150 
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Pro 


Val 


Trp 


Lys 


Lys 


Arg 


Arg 


Lys 


Thr 


Leu 


Glu 


Pro 


Arg 


Gin 


Arg 










155 










160 








165 


Gly 


Ala 


* * * 


Gly 


Thr 


Met 


Pro 


Leu 


+ * + 


Phe 


Asn 


Thr 


Gin 


Arg 


Gly 










170 










175 








180 


Leu 


Gly 


Leu 


Gly 


Thr 


* * * 


Ser 


Leu 


*■ *■ + 


Leu 


Lys 


Leu 


Leu 


Pro 


Phe 










185 










190 








195 


He 


Phe 


* * + 


Pro 


Gin 


Phe 


* * + 


Asp 


Pro 


Lys 


Trp 


Gly 


Val 


Asp 


Thr 










200 










205 








210 


Glu 


Val 


Pro 


Arg 


Arg 
215 


Glu 


Gly 


Ala 

















(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 



Val 


Phe 


Tyr 


Phe 


Gly 


Asn 


Gly 


Trp 


He 


Pro 


Thr 


Leu 


He 


Thr 


Ala 


1 








5 










10 










15 


Phe 


Val 


Leu 


Ala 


Thr 


Ser 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 


Gin 


His 










20 










25 








30 


Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Pro 


Lys 


Trp 


Asn 


His 










35 










40 






45 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 










50 










55 








60 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 










65 










70 








75 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Xxx 














80 










85 











(2) INFORMATION FOR SEQ ID NO: 37: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



Gin 


Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


Ala 


Gin 


1 








5 










10 










15 


Arg 


Ser 


Gly 


Cys 


Glu 


Glu 


Arg 


Trp 


Leu 


Val 


He 


Asp 


Arg 


Lys 


Val 


Tyr 








20 










25 






30 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 


Gly 


Gly 


Ser 


Arg 


Val 








35 










40 








45 


lie 


Ser 


His 


Tyr 


Ala 


Gly 


Gin 


Asp 


Ala 


Thr 


Asp 


Pro 


Phe 


Val 


Ala 


Phe 






50 










55 








60 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 


Tyr 


Met 


Asn 


Ser 


Leu 








65 










70 










75 
Pro 


Leu 


lie 


Gly 


Glu 


Leu 


Ser 


Pro 


Glu 


Gin 


Pro 


Ser 


Phe 


Glu 
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80 










85 










90 


Thr 


Lys 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 


Glu 


Leu 


Arg 


Ala 










95 










100 










105 


Thr 


Val 


Glu 


Arg 


Met 


Gly 


Leu 


Met 


Lys 


Ala 


Asn 


His 


Val 


Phe 


Phe 










110 










115 










120 


Leu 


Leu 


Tyr 


Leu 


Leu 


His 


He 


Leu 


Leu 


Leu 


Asp 


Gly 


Ala 


Ala 


Trp 










125 










130 










135 


Leu 


Thr 


Leu 


Trp 


Val 


Phe 


Gly 


Thr 


Ser 


Phe 


Leu 


Pro 


Phe 


Leu 


Leu 










140 










145 










150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 










155 










160 






165 


Gin 


His 


Asp 


Phe 


Gly 


His 


Leu 


Ser 


Val 


Phe 


Ser 


Thr 


Ser 


Lys 


Trp 










170 










175 










180 


Asn 


His 


Leu 


Leu 


His 


His 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 










185 










190 










195 


Pro 


Ala 


Ser 


Trp 


Trp 


Asn 


His 


Met 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 










200 










205 










210 


Pro 


Asn 


Cys 


Phe 


Arg 


Lys 


Asp 


Pro 


Asp 


He 


Asn 


Met 


His 


Pro 


Phe 










215 










220 










225 


Phe 


Phe 


Ala 


Leu 


Gly 


Lys 


He 


Leu 


Ser 


Val 


Glu 


Leu 


Gly 


Lys 


Gin 










230 










235 








240 


Lys 


Lys 


Lys 


Tyr 


Met 


Pro 


Tyr 


Asn 


His 


Gin 


His 


Xxx 


Tyr 


Phe 


Phe 










245 










250 








255 


Leu 


He 


Gly 


Pro 


Pro 


Ala 


Leu 


Leu 


Pro 


Leu 


Tyr 


Phe 


Gin 


Trp 


Tyr 










260 










265 








270 


He 


Phe 


Tyr 


Phe 


Val 


He 


Gin 


Arg 


Lys 


Lys 


Trp 


val 


Asp 


Leu 


Ala 










275 










280 










285 


Trp 


He 


Ser 


Lys 


Gin 


Glu 


Tyr 


Asp 


Glu 


Ala 


Gly 


Leu 


Pro 


Leu 


Ser 










290 










295 










300 



Thr Ala Asn Ala Ser Lys 
305 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2511785) 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



His 


Leu 


Lys 


Gly Ala 


Ser 


Ala 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


1 








5 










10 










15 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 










20 










25 








30 


Asn 


Met 


Leu 


His 


Val 
35 


Phe 


Val 


Leu 


Gly 


Glu 
40 


Trp 


Gin 


Pro 


He 


Glu 
45 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His 


Gin 


His 


Glu 








50 










55 










60 


Tyr 


Phe 


Phe 


Leu 
65 


He 


Gly 


Pro 


Pro 


Leu 
70 


Leu 


He 


Pro 


Met 


Tyr 
75 


Phe 


Gin 


Tyr 


Gin 


He 


He 


Met 


Thr 


Met 


He 


Val 


His 


Lys 


Asn 


Trp 










80 










85 








90 


Val 


Asp 


Leu 


Ala 


Trp 


Ala 


Val 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe 


Phe 


He 


Thr 








95 










100 










105 


Tyr 


He 


Pro 


Phe 


Tyr 


Gly 


He 


Leu 


Gly 


Ala 


Leu 


Leu 


Phe 


Leu 




Phe 






110 










115 










120 


Asn 


He 


Arg 


Phe 


Leu 


Glu 


Ser 


His 


Trp 


Phe 


Val 


Trp 


Val 


Thr 










125 










130 








135 


Gin 


Met 


Asn 


His 


He 


Val 


Met 


Glu 


He 


Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 










140 










145 








150 



-121- 



WO 98/46763 



PCT/US98/07126 



5 
10 
15 
20 

3 25 

i.'J 

£5 30 

b 35 
£3 

J 3 40 
45 
50 
55 
60 
65 



155 

Ser Phe Phe Asn Asp Trp Phe 
170 

Glu His His Leu Phe Pro Thi 
185 

He Ala Pro Leu Val Lys Sei 
200 

Tyr Gin Glu Lys Pro Leu Leu 
215 

Ser Leu Lys Lys Ser Gly Lys 
230 

Lys *** Ser His Ser Pro Arg 
245 

Trp Gly Asp Gly Gin Arg Asn 
260 

Ser Glu Arg Leu Val Tyr Ala 
275 

Leu Ser Pro Phe Leu Leu Ser 
290 

Ser Thr Leu Pro Ser Trp Asp 
305 

Ser Ala Met Ala Leu Pro Val 
320 

Ala Glu Arg Trp Pro Pro Gly 
335 

Leu Pro Leu Lys Met Gly Gly 
350 

Cys Glu Ser Pro Leu Ala Ala 
365 

Leu Val Leu Gin Met Leu Leu 
380 

Arg Ala Gly Pro Leu Thr Leu 
400 

Arg Leu Pro Leu Val His Pro 
415 

Gin Ser Ser Gly Leu Pro Pro 
430 

Leu Ser *** Asp Val Gin Gly 
445 

Pro Asn Leu Gly Pro Trp Lys 
4 60 

Ala Leu Thr Leu Gly Phe His 
475 

Thr *** Ala Cys Asp Leu Gly 
490 

Leu *** Leu Ser Arg Gly Ser 
505 

Trp Pro Gly Gly Ser Ala His 
520 

Leu Arg Ser Lys He Leu Glu 
535 

Leu Leu Ser Ala Gly Gin Cys 
550 

Pro Gly Asp Val Gly Pro Xxx 
565 



i Thr 


Cys 


Asn 


Val 


Glu 


Gin 


160 










165 


' His 


Leu 


Asn 


Phe 


Gin 


He 


175 










180 


> Arg 


His 


Asn 


Leu 


His 


Lys 


190 










195 


Ala 


Lys 


His 


Gly 


He 


Glu 


205 










210 


Leu 


Leu 


Asp 


He 


He 


Arg 


220 










225 


• Leu 


Asp 


Ala 


Tyr 


Leu 


His 


235 










240 


Val 


Gly 


Lys 


Gly 


Cys 


Arg 


250 










255 


Leu 


Leu 


Phe 


+ * * 


Gly Val 


265 










270 


Thr 


Asp 


Pro 


Met 


Leu 


Asp 


280 










285 


Ser 


Ser 


His 


Leu 


Pro 


His 


295 










300 


Ser 


Leu 


Ser 


Arg 


Gin 


Pro 


310 










315 


Ser 


Pro 


Phe 


Phe 


Gin 


Gly 


325 










330 


Leu 


Ser 


Tyr 


Leu 


His 


Ser 


340 










345 


Arg 


Ser 


Met 


Gly 


Leu 


Ala 


355 










360 


Leu 


Gly 


lie 


Thr 


Pro 


Ala 


370 










375 


He 


Gly 


Ala 


Gly 


Pro 


Ser 


385 










390 


Trp 


Leu 


His 


Ser 


Pro 


* ★ + 


405 










410 


Glu 


Arg 


Pro 


Ala 


Leu 


Leu 


420 










425 


Arg 


Leu 


Ser 


Thr 


Arg 


Gly 


435 










440 


Pro 


Ala 


Gly 


Thr 


Ala 


Ser 


450 










455 


Pro 


Pro 


His 


His 


* * * 


Ser 


4 65 










470 


His 


Ser 


Thr 


Ala 


Ser 


Pro 


480 










485 


Gly 


Gly 


Val 


Pro 


Arg 


Leu 


495 








500 


Val 


Gin 


Gly 


Gly 


Ala 


Gly 


510 










515 


Ala 


Phe 


Pro 


Gin 


Gly Val 


525 










530 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 


540 








545 


He 


Pro 


Gly 


His 


Leu 


Ala 


555 










560 



(2) INFORMATION FOR SEQ ID NO: 39: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: amino acid (Translation of Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



Val 


Phe 


Tyr 


Phe 


Gly 


Asn 


Gly 


Trp 


1 








5 








Phe 


Val 


Leu 


Ala 


Thr 
20 


Ser 


Gin 


Ala 


Asp 


Tyr 


Gly 


His 


Leu 
35 


Ser 


Val 


Tyr 


Leu 


Val 


His 


Lys 


Phe 
50 


Val 


He 


Gly 


Asn 


Trp 


Trp 


Asn 


His 
65 


Arg 


His 


Phe 


He 


Phe 


His 


Lys 


Asp 
80 


Pro 


Asp 


Val 


Leu 


Gly 


Glu 


Trp 


Gin 
95 


Pro 


He 


Glu 


Tyr 


Leu 


Pro 


Tyr 


Asn 
110 


His 


Gin 


His 


Pro 


Pro 


Leu 


Leu 


He 
125 


Pro 


Met 


Tyr 


Thr 


Met 


He 


Val 


His 
140 


Lys 


Asn 


Trp 


Ser 


Tyr 


Tyr 


He 


Arg 
155 


Phe 


Phe 


He 


He 


Leu 


Gly 


Ala 


Leu 
170 


Leu 


Phe 


Leu 


Ser 


His 


Trp 


Phe 


Val 
185 


Trp 


Val 


Thr 


Glu 


He 


Asp 


Gin 


Glu 
200 


Ala 


Tyr 


Arg 


Thr 


Ala 


Thr 


Cys 


Asn 
215 


Val 


Glu 


Gin 


Ser 


Gly 


His 


Leu 


Asn 
230 


Phe 


Gin 


He 


Met 


Pro 


Arg 


His 


Asn 
245 


Leu 


His 


Lys 


Leu 


Cys 


Ala 


Lys 


His 
260 


Gly 


He 


Glu 


Arg 


Ala 


Leu 


Leu 


Asp 
275 


He 


He 


Arg 


Leu 


Trp 


Leu 


Asp Ala 


Tyr 


Leu 


His 










290 








Asp 


Thr 


Val 


Gly 


Lys 
305 


Gly 


Cys 


Arg 


Asp 


Gly 


Leu 


Leu 


Phe 


* * * 


Gly Val 










320 








Leu 


Leu 


Thr 


Asp 


Pro 
335 


Met 


Leu 


Asp 


Phe 


Phe 


Ser 


Ser 


His 
350 


Leu 


Pro 


His 


Leu 


Pro 


Ser 


Leu 


Ser 
365 


Arg 


Gin 


Pro 


Pro 


Pro 


Ser 


Pro 


Phe 


Phe 


Gin 


Gly 










380 






Val 


Ala 


Leu 


Ser 


Tyr 
400 


Leu 


His 


Ser 


Asp 


Gin 


Arg 


Ser 


Met 
415 


Gly 


Leu 


Ala 


Trp 


Ser 


Leu 


Gly 


He 
430 


Thr 


Pro 


Ala 


Gly 


Phe 


He 


Gly Ala 


Gly 


Pro 


Ser 










445 









He 


Pro 


Thr 


Leu 


He 


Thr 


Ala 




10 










15 


Gin 


Ala 


Glv 


TrD 


Leu 


Gin 


His 




25 










30 


Arg 


Lvs 


Pro 


Lys 


TrD 


Asn 


His 




40 










45 


His 


Leu 


Lys 


Gly 


Ala 


Ser 


Ala 




55 










60 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 




70 










75 


Asn 


Met 


Leu 


His 


Val 


Phe 


Val 




85 










90 


Tvr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 




100 










105 


Glu 


i yr 


Phe 


tr lit; 


Leu 


Tip 

-L its 






115 










120 


Phe 


Gin 


Tl/T 

i yt 


Gin 


He 


He 


Met 




130 










135 


Val 


Asp 


Leu 


Ala 




Ala 


Val 




145 










150 


Thr 


Tvr 


He 


Pro 


Phe 


Tvr 


Glv 




160 










165 


Asn 


Phe 


He 




Phe 


Leu 


Glu 




175 










180 


Gin 


Met 


Asn 


His 


He 


Val 


Met 




190 










195 


Asp 


Trp 


Phe 


Ser 


Ser 


Gin 


Leu 




205 










210 


Ser 


Phe 


Phe 


As n 


Asp 




Phe 




220 










225 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 




235 










240 


He 


Ala 


Pro 


Leu 


Val 


Lys 


Se r 




250 










255 


Tyr 


Gin 


Glu 


Lvs 


Pro 


Leu 


Leu 




265 










270 


Ser 


Leu 


Lys 


Lys 


Ser 


Glv 


Lvs 




280 










285 


Lys 


+ * * 


Ser 


His 


Ser 


Pro 


Arg 




295 










300 


Trp Gly Asp 


Gly 


Gin 


rg 






310 










315 


Ser 


Glu 


Arg 


Leu 


Val 


Tvr 


Ala 




325 










330 


Leu 


Ser 


Pro 


Phe 


Leu 


Leu 


Ser 




340 










345 


Ser 


Thr 


Leu 


Pro 


Ser 


Trp 


Asp 




355 










360 


Ser 


Ala 


Met 


Ala 


Leu 


Pro 


Val 




370 










375 


Ala 


Glu 


Arg 


Trp 


Pro 


Pro 


Gly 




385 










390 


Leu 


Pro 


Leu 


Lys 


Met 


Gly 


Gly 




405 










410 


Cys 


Glu 


Ser 


Pro 


Leu 


Ala 


Ala 




420 










425 


Leu 


Val 


Leu 


Gin 


Met 


Leu 


Leu 




435 










440 


Arg 


Ala 


Gly 


Pro 


Leu 


Thr 


Leu 




450 










455 
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Pro 


Ala 


Trp 


Leu 


His 
460 


Ser 


Pro 


* * + 


Arg 


Leu 
4 65 


Pro 


Leu 


Val 


His 


Pro 

H IK) 


Phe 


He 


Glu 


Arg 


Pro 


Ala 


Leu 


Leu 


Gin 


Ser 


Ser 


Gly 


Leu 


Pro 


P-rrt 










475 










480 








A ft ^ 


Ala 


Ala 


Arg 


Leu 


Ser 
490 


Thr 


Arg 


Glv 


Leu 


Ser 
4 95 


* *■ + 


Asp 


Val 


Gin 


uiy 
ouu 


Pro 


Arg 


Pro 


Ala 


Gly 


Thr 


Ala 


Ser 


Pro 


Asn 


Leu 


Gly 


Pro 


Trp 


Lys 










505 










510 








^1 

jIj 


Ser 


Pro 


Pro 


Pro 


His 


His 


+ * * 


Ser 


Ala 


Leu 


Thr 


Leu 


Gly 


nits 


nio 










520 










525 








con 


Gly 


Pro 


His 


Ser 


Thr 
535 


Ala 


Ser 


Pro 


Thr 


* * * 
540 


Ala 


Cys 


Asp 


Leu 


Gly 
545 


Thr 


Lys 


Gly 


Gly 


Val 


Pro 


Arg 


Leu 


Leu 


* + * 


Leu 


Ser 


Arg 


Gly 


Ser 










550 










555 






560 


Gly 


His 


Val 


Gin 


Gly 


Gly 


Ala 


Gly 


Trp 


Pro 


Gly 


Gly 


Ser 


Ala 


His 










565 










570 








575 


Pro 


Pro 


Ala 


Phe 


Pro 


Gin 


Gly 


Val 


Leu 


Arg 


Ser 


Lys 


He 


Leu 


Glu 










580 










585 








590 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 










595 










600 








605 


Gin 


Pro 


He 


Pro 


Gly 
610 


His 


Leu 


Ala 


Pro 


Gly 
615 


Asp 


Val 


Gly 


Pro 


Xxx 
620 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



^ MOLECULE TYPE: amino acid (Translation of Contig 253538a) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



40 



45 



Gin 


Gly 


Pro 


Thr 


Pro 


Arg 


Tyr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


Ala 


Gin 


1 








5 










10 










15 


Arg 


Ser 


Gly 


Cys 


Glu 


Glu 


Arg 


Trp 


Leu 


Val 


He 


Asp 


Arg 


Lys 


Val 










20 










25 






30 


Tyr 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


His 


Pro 


Gly 


Gly 


Ser 


Arg 


Val 








35 










40 










45 


He 


Ser 


His 


Tyr 


Ala 


Gly 


Gin 


Asp 


Ala 


Thr 


Asp 


Pro 


Phe 


Val 


Ala 








50 










55 








60 


Phe 


His 


He 


Asn 
65 


Lys 


Gly 


Leu 


Val 


Lys 
70 


Lys 


Tyr 


Met 


Asn 


Ser 
75 


Leu 


Leu 


He 


Gly 


Glu 


Leu 


Ser 


Pro 


Glu 


Gin 


Pro 


Ser 


Phe 


Glu 


Pro 


Thr 








80 










85 










90 


Lys 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 


Glu 


Leu 


Arg 


Ala 


Thr 








95 










100 








105 


Val 


Glu 


Arg 


Met 
110 


Gly 


Leu 


Met 


Lys 


Ala 
115 


Asn 


His 


Val 


Phe 


Phe 
120 


Leu 


Leu 


Tyr 


Leu 


Leu 


His 


He 


Leu 


Leu 


Leu 


Asp 


Gly 


Ala 


Ala 


Trp 


Leu 








125 










130 








135 


Thr 


Leu 


Trp 


Val 


Phe 


Gly 


Thr 


Ser 


Phe 


Leu 


Pro 


Phe 


Leu 


Leu 


Cys 


Ala 






140 










145 










150 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Gin 


Ala 


Gin 


Ala 


Gly Trp 


Leu 


Gin 






155 










160 










165 


His 


Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Pro 


Lys 


Trp 








170 










175 






180 


Asn 


His 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 








185 










190 








195 


Ser 


Ala 


Asn 


Trp 
200 


Trp 


Asn 


His 


Arg 


His 
205 


Phe 


Gin 


His 


His 


Ala 
210 
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Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 












215 










Val 


Phe 


Val 


Leu 


Gly 


Glu 


Trp 


Gin 












230 








5 


Lys 


Leu 


Lys 


Tyr 


Leu 


Pro 


Tyr 


Asn 












245 










Leu 


He 


Gly 


Pro 


Pro 


Leu 


Leu 


He 












260 










lie 


He 


Met 


Thr 


Met 


He 


Val 


His 


10 










275 










Trp 


Ala 


Val 


Ser 


Tyr 


Tyr 


He 


Arg 












290 










Phe 


Tyr 


Gly 


He 


Leu 


Gly 


Ala 


Leu 


15 










305 








Phe 


Leu 


Glu 


Ser 


His 


Trp 


Phe 


Val 












320 










lie 


Val 


Met 


Glu 


He 


Asp 


Gin 


Glu 












335 










Ser 


Gin 


Leu 


Thr 


Ala 


Thr 


Cys 


Asn 


20 










350 








Asp 


Trp 


Phe 


Ser 


Gly 


His 


Leu 


Asn 












365 










Phe 


Pro 


Thr 


Met 


Pro 


Arg 


His 


Asn 












380 








3 25 


Val 


Lys 


Ser 


Leu 


Cys 


Ala 


Lys 


His 


: i 

XT 










400 








n 


Pro 


Leu 


Leu 


Arg 


Ala 


Leu 


Leu 


Asp 












415 








— 


Ser 


Gly 


Lys 


Leu 


Trp 


Leu 


Asd 


Ala 












430 








Ser 


Pro 


Arg 


Asp 


Thr 


Val 


Gly 


Lys 


■ 1 










445 










Gin 


Arg 


Asn 


Asp 


Gly 


Leu 


Leu 


Phe 












4 60 








335 


Val 


Tyr 


Ala 


Leu 


Leu 


Thr 


Asp 


Pro 












475 








3 


Leu 


Leu 


Ser 


Phe 


Phe 


Ser 


Ser 


His 












490 










Ser 


Trp 


Asp 


Leu 


Pro 


Ser 


Leu 


Ser 


340 








505 








i 


Leu 


Pro 


Val 


Pro 


Pro 


Ser 


Pro 


Phe 












520 










Pro 


Pro 


Gly 


Val 


Ala 


Leu 


Ser 


Tyr 


45 










535 








Met 


Gly 


Gly 


Asp 


Gin 


Arg 


Ser 


Met 












550 










Leu 


Ala 


Ala 


Trp 


Ser 


Leu 


Gly 


He 












565 








50 


Met 


Leu 


Leu 


Gly 


Phe 


He 


Gly Ala 










580 










Leu 


Thr 


Leu 


Pro 


Ala 


Trp 


Leu 


His 












595 










Val 


His 


Pro 


Phe 


He 


Glu 


Arg 


Pro 


55 










610 








Leu 


Pro 


Pro 


Ala 


Ala 


Arg 


Leu 


Ser 












625 










Val 


Gin 


Gly 


Pro 


Arg 


Pro 


Ala 


Gly 












640 








60 


Pro 


Trp 


Lys 


Ser 


Pro 


Pro 


Pro 


His 








655 










Gly 


Phe 


His 


Gly 


Pro 


His 


Ser 


Thr 












670 










Asp 


Leu 


Gly 


Thr 


Lys 


Gly 


Gly Val 


■65., 










685 








Arg 


Gly 


Ser 


Gly 


His 


Val 


Gin 


Gly 












700 
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Pro 


Asp 


Val 


Asn 


Met 


Leu 


His 




220 










225 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Lys 




235 










240 


His 


Gin 


His 


Glu 


Tyr 


Phe 


Phe 




250 










255 


Pro 


Met 


Tyr 


Phe 


Gin 


Tyr 


Gin 




265 










270 


Lys 


Asn 


Trp 


Val 


Asd 


Leu 


Ala 




280 










285 


Phe 


Phe 


He 


Thr 


Tvr 


He 


Pro 




295 










300 


Leu 


Phe 


Leu 


Asn 


Phe 


He 


Arg 




310 










315 


Trp 


Val 


Thr 


Gin 


Met 


Asn 


His 




325 










330 


Ala 


Tyr 


Arg 


Asd 


Trp 


Phe 


Ser 




340 










345 


Val 


Glu 


Gin 


Ser 


Phe 


Phe 


Asn 




355 










360 


Phe 


Gin 


He 


Glu 


His 


His 


Leu 




370 










375 


Leu 


His 


Lys 


He 


Ala 


Pro 


Leu 




385 










390 


Gly 


He 


Glu 


Tyr 


Gin 


Glu 


Lys 




405 










410 


He 


He 


Arg 


Ser 


Leu 


Lys 


Lys 




420 










425 


Tyr 


Leu 


His 


Lvs 


★ * * 


Ser 


His 




435 










440 


Gly 


Cys 


Arg 


Trp 


Gly 


Asp Gly 




450 










455 


* * + 


Gly 


Val 


Ser 


Glu 


Arg 


Leu 




465 










470 


Met 


Leu 


Asp 


Leu 


Ser 


Pro 


Phe 




480 










485 


Leu 


Pro 


His 


Ser 


Thr 


Leu 


Pro 




495 










500 


Arg 


Gin 


Pro 


Ser 


Ala 


Met 


Ala 




510 










515 


Phe 


Gin 


Gly 


Ala 


Glu 


Arg 


Trp 




525 










530 


Leu 


His 


Ser 


Leu 


Pro 


Leu 


Lys 




540 










545 


Gly 


Leu 


Ala 


Cys 


Glu 


Ser 


Pro 




555 










560 


Thr 


Pro 


Ala 


Leu 


Val 


Leu 


Gin 




570 










575 


Gly 


Pro 


Ser 


Arg 


Ala 


Gly 


Pro 




585 










590 


Ser 


Pro 


* + * 


Arg 


Leu 


Pro 


Leu 




600 










605 


Ala 


Leu 


Leu 


Gin 


Ser 


Ser 


Gly 




615 










620 


Thr 


Arg 


Gly 


Leu 


Ser 


* + * 


Asp 




630 










635 


Thr 


Ala 


Ser 


Pro 


Asn 


Leu 


Gly 




645 










650 


His 


* * * 


Ser 


Ala 


Leu 


Thr 


Leu 




660 










665 


Ala 


Ser 


Pro 


Thr 


* * * 


Ala 


Cys 




675 










680 


Pro 


Arg 


Leu 


Leu 


* + * 


Leu 


Ser 




690 










695 


Gly Ala 


Gly 


Trp 


Pro 


Gly Gly 




705 










710 
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Ser Ala His 
lie Leu Glu 
Gly Gin Cys 
Gly Pro Xxx 



Pro Pro Ala 

715 
Gin Ser Asp 

730 
Gin Pro lie 

745 



Phe Pro Gin 
Pro Ser Pro 
Pro Gly His 



Gly Val Leu 
720 

Lys Ala Leu 
735 

Leu Ala Pro 
750 



Arg Ser Lys 
725 

Leu Ser Ala 
740 

Gly Asp Val 
755 
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SEQUENCE LISTING 

5 (1) GENERAL INFORMATION: 

(i) APPLICANT: KNUTZON, DEBORAH 
MURKER JI, PRADIP 
HUANG, YUNG-SHENG 
1 0 THURMOND , JENN I FER 

CHAUDHARY, SUNITA 
LEONARD, AMANDA 

(ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR SYNTHESIS 
15 OF LONG CHAIN POLY -UN SATURATED FATTY ACIDS 

(iii) NUMBER OF SEQUENCES: 40 

(iv) CORRESPONDENCE ADDRESS: 

20 (A) ADDRESSEE: LIMBACH AND LIMBACH LLP 

^ (B) STREET: 2001 FERRY BUILDING 

•'if (C) CITY: SAN FRANCISCO 

(D) STATE: CA 

I J (E) COUNTRY: USA 

ffr 25 (F) ZIP: 94111 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
I J 30 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Microsoft Word 



(vi) CURRENT APPLICATION DATA: 
£3 (A) APPLICATION NUMBER: 

Q 35 (B) (B) FILING DATE: 



(C) CLASSIFICATION: 



(viii) ATTORNEY /AGENT INFORMATION: 
i3 (A) NAME: WARD, MICHAEL R. 

40 (B) REGISTRATION NUMBER: 38,651 

(C) REFERENCE /DOCKET NUMBER: CGAB-210 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (415) 433-4150 
45 (B) TELEFAX: (415) 433-8716 

(C) TELEX: N/A 



50 



(2) INFORMATION FOR SEQ ID NO:l: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1617 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
55 (D) TOPOLOGY: linear 



60 



(ii) MOLECULE TYPE: other nucleic acid 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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V8 



10 



CGACACTCCT TCCTTCTTCT CACCCGTCCT AGTCCCCTTC AACCCCCCTC TTTGACAAAG 60 

ACAACAAACC ATGGCTGCTG CTCCCAGTGT GAGGACGTTT ACTCGGGCCG AGGTTTTGAA 120 

TGCCGAGGCT CTGAATGAGG GCAAGAAGGA TGCCGAGGCA CCCTTCTTGA TGATCATCGA 180 

CAACAAGGTG TACGATGTCC GCGAGTTCGT CCCTGATCAT CCCGGTGGAA GTGTGATTCT 24 0 

CACGCACGTT GGCAAGGACG GCACTGACGT CTTTGACACT TTTCACCCCG AGGCTGCTTG 300 

GGAGACTCTT GCCAACTTTT ACGTTGGTGA TATTGACGAG AGCGACCGCG ATATCAAGAA 3 60 

TGATGACTTT GCGGCCGAGG TCCGCAAGCT GCGTACCTTG TTCCAGTCTC TTGGTTACTA 420 

15 CGATTCTTCC AAGG C AT AC T ACGCCTTCAA GGTCTCGTTC AACCTCTGCA TCTGGGGTTT 4 80 

GTCGACGGTC ATTGTGGCCA AGTGGGGCCA GACCTCGACC CTCGCCAACG TGCTCTCGGC 54 0 

TGCGCTTTTG GGTCTGTTCT GGCAGCAGTG CGGATGGTTG GCTCACGACT TTTTGCATCA 600 

20 

CCAGGTCTTC CAGGACCGTT TCTGGGGTGA TCTTTTCGGC GCCTTCTTGG GAGGTGTCTG 660 

I J CCAGGGCTTC TCGTCCTCGT GGTGGAAGGA CAAGCACAAC ACTCACCACG CCGCCCCCAA 720 

ss. 

25 CGTCCACGGC GAGGATCCCG ACATTGACAC CCACCCTCTG TTGACCTGGA GTGAGCATGC 7 80 

GTTGGAGATG TTCTCGGATG TCCCAGATGA GGAGCTGACC CGCATGTGGT CGCGTTTCAT 84 0 



C3 GGTCCTGAAC CAGACCTGGT TTTACTTCCC CATTCTCTCG TTTGCCCGTC TCTCCTGGTG 900 

ju 30 

i.j CCTCCAGTCC ATTCTCTTTG TGCTGCCTAA CGGTCAGGCC CACAAGCCCT CGGGCGCGCG 960 

5 ;„ TGTGCCCATC TCGTTGGTCG AGCAGCTGTC GCTTGCGATG CACTGGACCT GGTACCTCGC 1020 

f|g 35 caccatgttc ctgttcatca aggatcccgt caacatgctg gtgtactttt tggtgtcgca ioso 

] l i ggcggtgtgc ggaaacttgt tggcgatcgt GTTCTCGCTC AACCACAACG GTATGCCTGT 114 0 

*-3 GATCTCGAAG GAGGAGGCGG TCGATATGGA TTTCTTCACG AAGCAG AT C A TCACGGGTCG 1200 

O 40 

TGATGTCCAC CCGGGTCTAT TTGCCAACTG GTTCACGGGT GGATTGAACT ATCAGATCGA 12 60 

GCACCACTTG TTCCCTTCGA TGCCTCGCCA CAACTTTTCA AAGATCCAGC CTGCTGTCGA 1320 

45 GACCCTGTGC AAAAAGTACA ATGTCCGATA CCACACCACC GGTATGATCG AGGGAACTGC 1380 

AGAGGTCTTT AGCCGTCTGA ACGAGGTCTC CAAGGCTGCC TCCAAGATGG GTAAGGCGCA 14 4 0 

GTAAAAAAAA AAACAAGGAC GTTTTTTTTC GCCAGTGCCT GTGCCTGTGC CTGCTTCCCT 1500 

50 

TGTCAAGTCG AGCGTTTCTG GAAAGGATCG TTCAGTGCAG TATCATCATT CTCCTTTTAC 1560 

CCCCCGCTCA T AT CT CAT T C ATTTCTCTTA TTAAACAACT TGTTCCCCCC TTCACCG 1617 
55 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 amino acids 

(B) TYPE: amino acid 

60 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



65 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
1 - 5 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
20 25 30 

Leu Met lie lie Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val lie Leu Thr His Val Gly Lys Asp Gly 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 70 75 80 

Ala Asn Phe Tyr Val Gly Asp lie Asp Glu Ser Asp Arg Asp lie Lys 
85 90 95 

Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 110 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys lie Trp Gly Leu Ser Thr Val lie Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 150 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
180 185 190 

Leu Gly Gly Val Cys Gin Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 

lie Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro lie Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser lie Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro lie Ser Leu Val Glu 
290 295 300 

Gin Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305 310 315 320 

Leu Phe lie Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
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325 



330 



335 



Gin Ala Val Cys Gly Asn Leu Leu Ala lie Val Phe Ser Leu Asn His 
340 345 350 

Asn Gly Met Pro Val lie Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
355 360 365 

Phe Thr Lys Gin lie lie Thr Gly Arg Asp Val His Pro Gly Leu Phe 
370 375 380 

Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 
385 390 395 400 

Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys lie Gin Pro Ala Val 
405 410 415 

Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
420 425 430 

lie Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
435 440 445 

Ala Ala Ser Lys Met Gly Lys Ala Gin 
450 455 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 14 88 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTCCCCTGTC GCTGTCGGCA CACCCCATCC TCCCTCGCTC CCTCTGCGTT TGTCCTTGGC 60 

CCACCGTCTC TCCTCCACCC TCCGAGACGA CTGCAACTGT AATCAGGAAC CGACAAATAC 120 

ACGATTTCTT TTTACTCAGC ACCAACTCAA AATCCTCAAC CGCAACCCTT TTTCAGGATG 180 

GCACCTCCCA ACACTATCGA TGCCGGTTTG ACCCAGCGTC ATATCAGCAC CTCGGCCCCA 24 0 

AACTCGGCCA AGCCTGCCTT CGAGCGCAAC TACCAGCTCC CCGAGTTCAC CATCAAGGAG 300 

ATCCGAGAGT GCATCCCTGC CCACTGCTTT GAGCGCTCCG GTCTCCGTGG TCTCTGCCAC 360 

GTTGCCATCG ATCTGACTTG GGCGTCGCTC TTGTTCCTGG CTGCGACCCA GATCGACAAG 420 

TTTGAGAATC CCTTGATCCG CTATTTGGCC TGGCCTGTTT ACTGGATCAT GCAGGGTATT 4 80 

GTCTGCACCG GTGTCTGGGT GCTGGCTCAC GAGTGTGGTC ATCAGTCCTT CTCGACCTCC 54 0 

AAGACCCTCA ACAACACAGT TGGTTGGATC TTGCACTCGA TGCTCTTGGT CCCCTACCAC 600 

TCCTGGAGAA TCTCGCACTC GAAGCACCAC AAGGCCACTG GCCATATGAC CAAGGACCAG 660 

GTCTTTGTGC CCAAGACCCG CTCCCAGGTT GGCTTGCCTC CCAAGGAGAA CGCTGCTGCT 720 
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GCCGTTCAGG AGGAGGACAT GTCCGTGCAC CTGGATGAGG AGGCTCCCAT TGTGACTTTG 7 80 

TTCTGGATGG TGATCCAGTT CTTGTTCGGA TGGCCCGCGT ACCTGATTAT GAACGCCTCT 8 40 

GGCCAAGACT ACGGCCGCTG GACCTCGCAC TTCCACACGT ACTCGCCCAT CTTTGAGCCC 900 

CGCAACTTTT T C G AC AT TAT TATCTCGGAC CTCGGTGTGT TGGCTGCCCT CGGTGCCCTG 960 

ATCTATGCCT CCATGCAGTT GTCGCTCTTG ACCGTCACCA AGTACTATAT TGTCCCCTAC 1020 

CTCTTTGTCA ACTTTTGGTT GGTCCTGATC ACCTTCTTGC AGCACACCGA TCCCAAGCTG 1080 

CCCCATTACC GCGAGGGTGC CTGGAATTTC CAGCGTGGAG CTCTTTGCAC CGTTGACCGC 1140 

TCGTTTGGCA AGTTCTTGGA CCATATGTTC CACGGCATTG TCCACACCCA TGTGGCCCAT 1200 

CACTTGTTCT CGCAAATGCC GTTCTACCAT GCTGAGGAAG CTACCTATCA TCTCAAGAAA 1260 

CTGCTGGGAG AGTACTATGT GTACGACCCA TCCCCGATCG TCGTTGCGGT CTGGAGGTCG 1320 

TTCCGTGAGT GCCGATTCGT GGAGGATCAG GGAGACGTGG TCTTTTTCAA GAAGTAAAAA 1380 

AAAAGACAAT GGACCACACA CAACCTTGTC TCTACAGACC TACGTATCAT GTAGCCATAC 144 0 

CACTTCATAA AAGAACATGA GCTCTAGAGG CGTGTCATTC GCGCCTCC 14 88 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Pro Pro Asn Thr lie Asp Ala Gly Leu Thr Gin Arg His lie 
15 10 15 

Ser Thr Ser Ala Pro Asn Ser Ala Lys Pro Ala Phe Glu Arg Asn Tyr 
20 25 30 



Gin Leu Pro Glu 
35 

His Cys Phe Glu 
50 



Phe Thr lie Lys 
40 

Arg Ser Gly Leu 
55 



Glu lie Arg Glu 

Arg Gly Leu Cys 
60 



Cys lie Pro Ala 
45 

His Val Ala He 



Asp Leu Thr Trp Ala Ser Leu Leu Phe Leu Ala Ala Thr Gin He Asp 
65 70 75 80 

Lys Phe Glu Asn Pro Leu He Arg Tyr Leu Ala Trp Pro Val Tyr Trp 
85 90 95 



He Met Gin Gly He Val Cys Thr Gly Val Trp Val Leu Ala His Glu 
100 105 110 

Cys Gly His Gin Ser Phe Ser Thr Ser Lys Thr Leu Asn Asn Thr Val 
115 120 125 
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Gly Trp lie Leu His Ser Met Leu Leu Val Pro Tyr His Ser Trp Arg 
130 135 140 

He Ser His Ser Lys His His Lys Ala Thr Gly His Met Thr Lys Asp 
145 150 155 160 

Gin Val Phe Val Pro Lys Thr Arg Ser Gin Val Gly Leu Pro Pro Lys 
165 170 175 

Glu Asn Ala Ala Ala Ala Val Gin Glu Glu Asp Met Ser Val His Leu 
180 185 190 

Asp Glu Glu Ala Pro He Val Thr Leu Phe Trp Met Val He Gin Phe 
195 200 205 

Leu Phe Gly Trp Pro Ala Tyr Leu He Met Asn Ala Ser Gly Gin Asp 
210 215 220 

Tyr Gly Arg Trp Thr Ser His Phe His Thr Tyr Ser Pro He Phe Glu 
225 230 235 240 

Pro Arg Asn Phe Phe Asp He He He Ser Asp Leu Gly Val Leu Ala 
245 250 255 

Ala Leu Gly Ala Leu He Tyr Ala Ser Met Gin Leu Ser Leu Leu Thr 
260 265 270 

Val Thr Lys Tyr Tyr lie Val Pro Tyr Leu Phe Val Asn Phe Trp Leu 
275 280 285 

Val Leu He Thr Phe Leu Gin His Thr Asp Pro Lys Leu Pro His Tyr 
290 295 300 

Arg Glu Gly Ala Trp Asn Phe Gin Arg Gly Ala Leu Cys Thr Val Asp 
305 310 315 320 

Arg Ser Phe Gly Lys Phe Leu Asp His Met Phe His Gly He Val His 
325 330 335 

Thr His Val Ala His His Leu Phe Ser Gin Met Pro Phe Tyr His Ala 
340 345 350 

Glu Glu Ala Thr Tyr His Leu Lys Lys Leu Leu Gly Glu Tyr Tyr Val 
355 360 365 

Tyr Asp Pro Ser Pro He Val Val Ala Val Trp Arg Ser Phe Arg Glu 
370 375 380 

Cys Arg Phe Val Glu Asp Gin Gly Asp Val Val Phe Phe Lys Lys 
385 390 395 

INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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Glu Val Arg Lys Leu Arg Thr Leu Phe Gin Ser Leu Gly Tyr Tyr Asp 
15 10 15 

Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val Ser Phe Asn Leu Cys lie 
20 25 30 

Trp Gly Leu Ser Thr Val lie Val Ala Lys Trp Gly Gin Thr Ser Thr 
35 40 45 

Leu Ala Asn Val Leu Ser Ala Ala Leu Leu Gly Leu Phe Trp Gin Gin 
50 55 60 

Cys Gly Trp Leu Ala His Asp Phe Leu His His Gin Val Phe Gin Asp 
65 70 75 80 

Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe Leu Gly Gly Val Cys Gin 
85 90 95 

Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys His Asn Thr His His Ala 
100 105 110 

Ala Pro Asn Val His Gly Glu Asp Pro Asp lie Asp Thr His Pro Leu 
115 120 125 

Leu Thr Trp Ser Glu His Ala Leu Glu Met Phe Ser Asp Val Pro Asp 
130 135 140 

Glu Glu Leu Thr Arg Met Trp Ser Arg Phe Met Val Leu Asn Gin Thr 
145 150 155 160 

Trp Phe Tyr Phe Pro lie Leu Ser Phe Ala Arg Leu Ser Trp Cys Leu 
165 170 175 

Gin Ser lie Leu Phe Val Leu Pro Asn Gly Gin Ala His Lys Pro Ser 
180 185 190 

Gly Ala Arg Val Pro lie Ser Leu Val Glu Gin Leu Ser Leu Ala Met 
195 200 205 

His Trp Thr Trp Tyr Leu Ala Thr Met Phe Leu Phe lie Lys Asp Pro 
210 215 220 

Val Asn Met Leu Val Tyr Phe Leu Val Ser Gin Ala Val Cys Gly Asn 
225 230 235 240 

Leu Leu Ala lie Val Phe Ser Leu Asn His Asn Gly Met Pro Val lie 
245 250 255 

Ser Lys Glu Glu Ala Val Asp Met Asp Phe Phe Thr Lys Gin lie lie 
260 265 270 

Thr Gly Arg Asp Val His Pro Gly Leu Phe Ala Asn Trp Phe Thr Gly 
275 280 285 

Gly Leu Asn Tyr Gin lie Glu His His Leu Phe Pro Ser Met Pro Arg 
290 295 300 

His Asn Phe Ser Lys lie Gin Pro Ala Val Glu Thr Leu Cys Lys Lys 
305 310 315 320 

Tyr Asn Val Arg Tyr His Thr Thr Gly Met He Glu Gly Thr Ala Glu 
325 330 335 

Val Phe Ser Arg Leu Asn Glu Val Ser Lys Ala Ala Ser Lys Met Gly 
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340 



345 



350 



Lys Ala Gin 
355 

INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Val Thr Leu Tyr Thr Leu Ala Phe Val Ala Ala Asn Ser Leu Gly Val 
15 10 15 

Leu Tyr Gly Val Leu Ala Cys Pro Ser Val Xaa Pro His Gin lie Ala 
20 25 30 

Ala Gly Leu Leu Gly Leu Leu Trp lie Gin Ser Ala Tyr lie Gly Xaa 
35 40 45 

Asp Ser Gly His Tyr Val lie Met Ser Asn Lys Ser Asn Asn Xaa Phe 
50 55 60 

Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly lie lie Ala Trp Trp 
65 70 75 80 

Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr 
85 90 95 

Gly Pro Asn Leu Gin His lie Pro 
100 

INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Gly Val Leu Tyr Gly Val Leu Ala Cys Thr Ser Val Phe Ala His Gin 
15 10 15 

lie Ala Ala Ala Leu Leu Gly Leu Leu Trp lie Gin Ser Ala Tyr lie 
20 25 30 

Gly His Asp Ser Gly His Tyr Val lie Met Ser Asn Lys Ser Tyr Asn 
35 40 45 
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Arg Phe Ala Gin Leu Leu Ser Gly Asn Cys Leu Thr Gly lie Ser lie 
50 55 60 



Ala Trp Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser 
65 70 75 80 

Leu Asp Tyr Asp Pro Asp Leu Gin His lie Pro Val Phe Ala Val Ser 
85 90 95 

Thr Lys Phe Phe Ser Ser Leu Thr Ser Arg Phe Tyr Asp Arg Lys Leu 
100 105 110 

Thr Phe Gly Pro Val Ala Arg Phe Leu Val Ser Tyr Gin His Phe Thr 
115 120 125 

Tyr Tyr Pro Val Asn Cys Phe Gly Arg lie Asn Leu Phe lie Gin Thr 
130 135 140 

Phe Leu Leu Leu Phe Ser Lys Arg Glu Val Pro Asp Arg Ala Leu Asn 
145 150 155 160 

Phe Ala Gly lie Leu Val Phe Trp Thr Trp Phe Pro Leu Leu Val Ser 
165 170 175 

Cys Leu Pro Asn Trp Pro Glu Arg Phe Phe Phe Val Phe Thr Ser Phe 
180 185 190 

Thr Val Thr Ala Leu Gin His lie Gin Phe Thr Leu Asn His Phe Ala 
195 200 205 

Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp Trp Phe Glu Lys 
210 215 220 

Gin Ala Ala Gly Thr lie Asp lie Ser Cys Arg Ser Tyr Met Asp Trp 
225 230 235 240 

Phe Phe Gly Gly Leu Gin Phe Gin Leu Glu His His 
245 250 

INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Xaa Xaa Asn Phe Ala Gly lie Leu Val Phe Trp Thr Trp Phe Pro 
15 10 15 

Leu Leu Val Ser Cys Leu Pro Asn Trp Pro Glu Arg Phe Xaa Phe Val 
20 25 30 

Phe Thr Gly Phe Thr Val Thr Ala Leu Gin His lie Gin Phe Thr Leu 
35 40 45 

Asn His Phe Ala Ala Asp Val Tyr Val Gly Pro Pro Thr Gly Ser Asp 
50 55 60 
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Trp Phe Glu Lys Gin Ala Ala Gly Thr lie Asp lie Ser Cys Arg Ser 
65 70 75 80 



Tyr Met Asp Trp Phe Phe Cys Gly Leu Gin Phe Gin Leu Glu His His 
85 90 95 

Leu Phe Pro Arg Leu Pro Arg Cys His Leu Arg Lys Val Ser Pro Val 
100 105 110 

Gly Gin Arg Gly Phe Gin Arg Lys Xaa Asn Leu Ser Xaa 
115 120 125 

INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Pro Ala Thr Glu Val Gly Gly Leu Ala Trp Met lie Thr Phe Tyr Val 
15 10 15 

Arg Phe Phe Leu Thr Tyr Val Pro Leu Leu Gly Leu Lys Ala Phe Leu 
20 25 30 

Gly Leu Phe Phe lie Val Arg Phe Leu Glu Ser Asn Trp Phe Val Trp 
35 40 45 

Val Thr Gin Met Asn His lie Pro Met His lie Asp His Asp Arg Asn 
50 55 60 

Met Asp Trp Val Ser Thr Gin Leu Gin Ala Thr Cys Asn Val His Lys 
65 70 75 80 

Ser Ala Phe Asn Asp Trp Phe Ser Gly His Leu Asn Phe Gin lie Glu 
85 90 95 

His His Leu Phe Pro Thr Met Pro Arg His Asn Tyr His Xaa Val Ala 
100 105 110 

Pro Leu Val Gin Ser Leu Cys Ala Lys His Gly lie Glu Tyr Gin Ser 
115 120 125 

Lys Pro Leu 
130 

INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Ser Pro Lys Ser Ser Pro Thr Arg Asn Met Thr Pro Ser Pro Phe 
15 10 15 

lie Asp Trp Leu Trp Gly Gly Leu Asn Tyr Gin lie Glu His His Leu 
20 25 30 

Phe Pro Thr Met Pro Arg Cys Asn Leu Asn Arg Cys Met Lys Tyr Val 
35 40 45 

Lys Glu Trp Cys Ala Glu Asn Asn Leu Pro Tyr Leu Val Asp Asp Tyr 
50 55 60 

Phe Val Gly Tyr Asn Leu Asn Leu Gin Gin Leu Lys Asn Met Ala Glu 
65 70 75 80 

Leu Val Gin Ala Lys Ala Ala 
85 

INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg His Glu Ala Ala Arg Gly Gly Thr Arg Leu Ala Tyr Met Leu Val 
15 10 15 

Cys Met Gin Trp Thr Asp Leu Leu Trp Ala Ala Ser Phe Tyr Ser Arg 
20 25 30 

Phe Phe Leu Ser Tyr Ser Pro Phe Tyr Gly Ala Thr Gly Thr Leu Leu 
35 40 45 

Leu Phe Val Ala Val Arg Val Leu Glu Ser His Trp Phe Val Trp lie 
50 55 60 

Thr Gin Met Asn His lie Pro Lys Glu lie Gly His Glu Lys His Arg 
65 70 75 80 

Asp Trp Ala Ser Ser Gin Leu Ala Ala Thr Cys Asn Val Glu Pro Ser 
85 90 95 

Leu Phe He Asp Trp Phe Ser Gly His Leu Asn Phe Gin He Glu His 
100 105 110 

His Leu Phe Pro Thr Met Thr Arg His Asn Tyr Arg Xaa Val Ala Pro 
115 120 125 

Leu Val Lys Ala Phe Cys Ala Lys His Gly Leu His Tyr Glu Val 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCAAGCTTCT GCAGGAGCTC TTTTTTTTTT TTTTT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CUACUACUAC UAGGAGTCCT CTACGGTGTT TTG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAUCAUCAUC AUATGATGCT CAAGCTGAAA CTG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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TACCAACTCG AGAAAATGGC TGCTGCTCCC AGTGTGAGG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AACTGATCTA GATTACTGCG CCTTACCCAT CTTGGAGGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TACCAACTCG AGAAAATGGC ACCTCCCAAC ACTATCGAT 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AACTGATCTA GATTACTTCT TGAAAAAGAC CACGTCTCC 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



U 

•3 (D) TOPOLOGY: linear 

25 



CGTATGTCAC TCCATTCCAA ACTCGTTCAT GGTATCATAA ATATCAACAC ATTTACGCTC 60 

CACTCCTCTA TGGTATTTAC ACACT CAAAT ATCGTACTCA AGATTGGGAA GCTTTTGTAA 120 

5 AGGATGGTAA AAATGGTGCA ATTCGTGTTA GTGTCGCCAC AAATTTCGAT AAGGCCGCTT 180 

ACGTCATTGG TAAATTGTCT TTTGTTTTCT TCCGTTTCAT CCTTCCACTC CGTTATCATA 24 0 

GCTTTACAGA TTTAATTTGT TATTTCCTCA TTGCTGAATT CGTCTTTGGT TGGTATCTCA 300 

CAATTAATTT CCAAGTTAGT CATGTCGCTG AAGATCTCAA ATTCTTTGCT ACCCCTGAAA 360 

GACCAGATGA ACCATCTCAA ATCAATGAAG ATTGGGCAAT CCTTCAACTT AAAACT ACT C 420 

10 AAGATTATGG TCATGGTTCA CTCCTTTGTA CCTTTTTTAG TGGTTCTTTA AATCATCAAG 4 80 

TTGTTCATCA TTTATTCCCA TCAATTGCTC AAGATTTCTA CCCACAACTT GTACCAATTG 54 0 

TAAAAGAAGT TTGTAAAGAA CATAACATTA CTTACCACAT TAAACCAAAC TTCACTGAAG 600 

CTATTATGTC ACACATTAAT TACCTTTACA AAATGGGTAA TGATCCAGAT TATGTTAAAA 660 

AACCATTAGC CTCAAAAGAT GATTAAATGA AATAACTTAA AAACCAATTA TTTACTTTTG 720 

15 ACAAACAGTA ATATTAATAA ATACAA 74 6 

(2) INFORMATION FOR SEQ ID NO:20: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 



M 30 



CO 35 

I n 

= 3 40 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



Tyr 


Val 


Thr 


Pro 


Phe 


Gin 


Thr 


Arg 


Ser 


Trp 


Tyr 


His 


Lys 


Tyr 


Gin 


1 








5 










10 










15 


His 


He 


Tyr 


Ala 


Pro 


Leu 


Leu 


Tyr 


Gly 


He 


Tyr 


Thr 


Leu 


Lys 


Tyr 










20 










25 










30 


Arg 


Thr 


Gin 


Asp 


Trp 


Glu 


Ala 


Phe 


Val 


Lys 


Asp 


Gly Lys 


Asn 


Gly 










35 










40 










45 


Ala 


He 


Arg 


Val 


Ser 


Val 


Ala 


Thr 


Asn 


Phe 


Asp 


Lys 


Ala 


Ala 


Tyr 










50 










55 










60 


Val 


He 


Gly 


Lys 


Leu 


Ser 


Phe 


Val 


Phe 


Phe 


Arg 


Phe 


He 


Leu 


Pro 










65 










70 










75 


Leu 


Arg 


Tyr 


His 


Ser 


Phe 


Thr 


Asp 


Leu 


He 


Cys 


Tyr 


Phe 


Leu 


He 










80 










85 










90 


Ala 


Glu 


Phe 


Val 


Phe 


Gly 


Trp 


Tyr 


Leu 


Thr 


He 


Asn 


Phe 


Gin 


Val 










95 










100 










105 


Ser 


His 


Val 


Ala 


Glu 


Asp 


Leu 


Lys 


Phe 


Phe 


Ala 


Thr 


Pro 


Glu 


Arg 










110 










115 










120 


Pro 


Asp 


Glu 


Pro 


Ser 


Gin 


He 


Asn 


Glu 


Asp 


Trp 


Ala 


He 


Leu 


Gin 










125 










130 










135 


Leu 


Lys 


Thr 


Thr 


Gin 


Asp 


Tyr 


Gly 


His 


Gly 


Ser 


Leu 


Leu 


Cys 


Thr 










140 










145 








150 


Phe 


Phe 


Ser 


Gly 


Ser 


Leu 


Asn 


His 


Gin 


Val 


Val 


His 


His 


Leu 


Phe 










155 










160 










165 


Pro 


Ser 


He 


Ala 


Gin 


Asp 


Phe 


Tyr 


Pro 


Gin 


Leu 


Val 


Pro 


He 


Val 










170 










175 










180 


Lys 


Glu 


Val 


Cys 


Lys 


Glu 


His 


Asn 


He 


Thr 


Tyr 


His 


He 


Lys 


Pro 










185 










190 








195 


Asn 


Phe 


Thr 


Glu 


Ala 


He 


Met 


Ser 


His 


He 


Asn 


Tyr 


Leu 


Tyr 


Lys 










200 










205 








210 


Met 


Gly 


Asn 


Asp 


Pro 


Asp 


Tyr 


Val 


Lys 


Lys 


Pro 


Leu 


Ala 


Ser 


Lys 










215 










220 










225 


Asp 


Asp 


Xaa 



























45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO 21: 
65 (i) SEQUENCE CHARACTERISTICS: 

-156- 



SUBSTITUTE SHEET 



(A) LENGTH: 4 94 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: nucleic acid 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

10 

TTTTGGAAGG NTCCAAGTTN AC C ACGGANT NGGCAAGTTN ACGGGGCGGA AANCGGTTTT 60 
CCCCCCAAGC CTTTTGTCGA CTGGTTCTGT GGTGGCTTCC AGTACCAAGT CGACCACCAC 120 
TTATTCCCCA GCCTGCCCCG ACACAATCTG GCCAAGACAC ACGCACTGGT CGAATCGTTC 180 
TGCAAGGAGT GGGGTGTCCA GTACCACGAA GCCGACCTCG TGGACGGGAC CATGGAAGTC 24 0 
15 TTGCACCATT TGGGCAGCGT GGCCGGCGAA TTCGTCGTGG ATTTTGTACG CGACGGACCC 300 

GCCATGTAAT CGTCGTTCGT GACGATGCAA GGGTTCACGC ACATCTACAC ACACTCACTC 360 
ACACAACTAG TGTAACTCGT ATAGAATTCG GTGTCGACCT GGACCTTGTT TGACTGGTTG 420 
GGGATAGGGT AGGTAGGCGG ACGCGTGGGT CGNCCCCGGG AATTCTGTGA CCGGTACCTG 4 80 
GCCCGCGTNA AAGT 494 

20 

(2) INFORMATION FOR SEQ ID NO: 22: 

S,y 25 (i) SEQUENCE CHARACTERISTICS: 

|!n (A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 



C3 

■ : y 



m 35 



(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Phe 


Trp 


Lys 


Xaa 


Pro 


Ser 


Xaa 


Pro 


Arg 


Xaa 


Xaa 


Gin 


Val 


Xaa 


Gly 


1 








5 










10 










15 


Ala 


Glu 


Xaa 


Gly 


Phe 
20 


Pro 


Pro 


Lys 


Pro 


Phe 
25 


Val 


Asp 


Trp 


Phe 


Cys 
30 


Gly 


Gly 


Phe 


Gin 


Tyr 
35 


Gin 


Val 


Asp 


His 


His 
40 


Leu 


Phe 


Pro 


Ser 


Leu 
45 


Pro 


Arg 


His 


Asn 


Leu 
50 


Ala 


Lys 


Thr 


His 


Ala 
55 


Leu 


Val 


Glu 


Ser 


Phe 
60 


Cys 


Lys 


Glu 


Trp 


Gly 
65 


Val 


Gin 


Tyr 


His 


Glu 
70 


Ala 


Asp 


Leu 


Val 


Asp 
75 


Gly 


Thr 


Met 


Glu 


Val 


Leu 


His 


His 


Leu 


Gly 


Ser 


Val 


Ala 


Gly 


Glu 










65 










70 








75 


Phe 


Val 


Val 


Asp 


Phe 
80 


Val 


Arg 


Asp 


Gly 


Pro 
85 


Ala 


Met 









40 



45 



50 



55 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 nucleic acids 

(B) TYPE: amino acid 

60 (C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
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GGATGGAGTT CGTCTGGATC GCTGTGCGCT ACGCGACGTG GTTTAAGCGT CATGGGTGCG 60 

CTTGGGTACA CGCCGGGGCA GTCGTTGGGC ATGTACTTGT GCGCCTTTGG TCTCGGCTGC 120 

5 ATTTACATTT TTCTGCAGTT CGCCGTAAGT CACACCCATT TGCCCGTGAG CAACCCGGAG 180 

GATCAGCTGC ATTGGCTCGA GTACGCGCGG ACCACACTGT GAACATCAGC ACCAAGTCGT 24 0 

GGTTTGTCAC ATGGTGGATG TCGAACCTCA ACTTTCAGAT CGAGCACCAC CTTTTCCCCA 300 

CGGCGCCCCA GTTCCGTTTC AAGGAGATCA GCCCGCGCGT CGAGGCCCTC TTCAAGCGCC 360 

ACGGTCTCCC TTACTACGAC ATGCCCTACA CGAGCGCCGT CTCCACCACC TTTGCCAACC 420 

10 TCTACTCCGT CGGCCATTCC GTCGGCGACG CCAAGCGCGA CTAGCCTCTT TTCCTAGACC 4 80 

TTAATTCCCC ACCCCACCCC ATGTTCTGTC TTCCTCCCGC 520 



15 



25 



i« 35 



40 



45 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 24: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
20 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 



Met 


Glu 


Phe 


Val 


Trp 


He 


Ala 


Val 


Arg 


Tyr 


Ala 


Thr 


Trp 


Phe 


Lys 


1 








5 










10 










15 


Arg 


His 


Gly 


Cys 


Ala 


Trp 


Val 


His 


Ala 


Gly 


Ala 


Val 


Val 


Gly 


His 










20 










25 










30 


Val 


Leu 


Val 


Arg 


Leu 


Trp 


Ser 


Arg 


Leu 


His 


Leu 


His 


Phe 


Ser 


Ala 










35 










40 










45 


Val 


Arg 


Arg 


Lys 


Ser 


His 


Pro 


Phe 


Ala 


Arg 


Glu 


Gin 


Pro 


Gly Gly 










50 










55 










60 


Ser 


Ala 


Ala 


Leu 


Ala 


Arg 


Val 


Arg 


Ala 


Asp 


His 


Thr 


Val 


Asn 


He 










65 










70 










75 


Ser 


Thr 


Lys 


Ser 


Trp 


Phe 


Val 


Thr 


Trp 


Trp 


Met 


Ser 


Asn 


Leu 


Asn 










80 










85 










90 


Phe 


Gin 


He 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 


Ala 


Pro 


Gin 


Phe 


Arg 










95 










100 










105 


Phe 


Lys 


Glu 


He 


Ser 


Pro 


Arg 


Val 


Glu 


Ala 


Leu 


Phe 


Lys 


Arg 


His 










110 










115 










120 


Gly 


Leu 


Pro 


Tyr 


Tyr 


Asp 


Met 


Pro 


Tyr 


Thr 


Ser 


Ala 


Val 


Ser 


Thr 










125 










130 










135 


Thr 


Phe 


Ala 


Asn 


Leu 


Tyr 


Ser 


Val 


Gly 


His 


Ser 


Val 


Gly 


Asp 


Ala 










140 










145 










150 


Lys 


Arg 


Asp 



























(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 nucleic acids 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: not relevant 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: nucleic acid 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



ACGCGTCCGC CCACGCGTCC GCCGCGAGCA ACTCATCAAG GAAGGCTACT TTGACCCCTC 60 
65 GCTCCCGCAC ATGACGTACC GCGTGGTCGA GATTGTTGTT CTCTTCGTGC TTTCCTTTTG 120 
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10 



20 

o 

id 25 



U 30 

a 

s . 5 
5:^ 



CO 35 

□ 



55 



60 



GCTGATGGGT CAGTCTTCAC CCCTCGCGCT CGCTCTCGGC ATTGTCGTCA GCGGCATCTC 180 

TCAGGGTCGC TGCGGCTGGG TAATGCATGA GATGGGCCAT GGGTCGTTCA CTGGTGTCAT 240 

TTGGCTTGAC GACCGGTTGT GCGAGTTCTT TTACGGCGTT GGTTGTGGCA TGAGCGGTCA 300 

TTACTGGAAA AACCAGCACA GCAAACACCA CGCAGCGCCA AACCGGCTCG AGCACGATGT 360 

AGATCTCAAC ACCTTGCCAT TGGTGGCCTT CAACGAGCGC GTCGTGCGCA AGGTCCGACC 4 20 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Arg 


Val 


Arg 


Pro 


Arg 


Val 


Arg 


Arg 


Glu 


Gin 


Leu 


He 


Lys 


Glu 


Gly 


1 








5 










10 










15 


Tyr 


Phe 


Asp 


Pro 


Ser 


Leu 


Pro 


His 


Met 


Thr 


Tyr 


Arg 


Val 


Val 


Glu 










20 










25 










30 


lie 


Val 


Val 


Leu 


Phe 


Val 


Leu 


Ser 


Phe 


Trp 


Leu 


Met 


Gly 


Gin 


Ser 










35 










40 










45 


Ser 


Pro 


Leu 


Ala 


Leu 


Ala 


Leu 


Gly 


He 


Val 


Val 


Ser 


Gly 


He 


Ser 










50 










55 










60 


Gin 


Gly 


Arg 


Cys 


Gly 


Trp 


Val 


Met 


His 


Glu 


Met 


Gly His 


Gly 


Ser 










65 










70 










75 


Phe 


Thr 


Gly 


Val 


He 


Trp 


Leu 


Asp 


Asp 


Arg 


Leu 


Cys 


Glu 


Phe 


Phe 










65 










70 










75 


Tyr 


Gly Val 


Gly 


Cys 


Gly 


Met 


Ser 


Gly 


His 


Tyr 


Trp 


Lys 


Asn 


Gin 










80 










85 










90 


His 


Ser 


Lys 


His 


His 


Ala 


Ala 


Pro 


Asn 


Arg 


Leu 


Glu 


His 


Asp 


Val 










95 










100 










105 


Asp 


Leu 


Asn 


Thr 


Leu 


Pro 


Leu 


Val 


Ala 


Phe 


Asn 


Glu 


Arg 


Val 


Val 










110 










115 










120 


Arg 


Lys 


Val 


Arg 


Pro 






























125 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:27: 















40 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2692004) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

GCACGCCGAC CGGCGCCGGG AGATCCTGGC AAAGTATCCA GAGATAAAGT CCTTGATGAA 60 

ACCTGATCCC AATTTGATAT GGATTATAAT TATGATGGTT CTCACCCAGT TGGGTGCATT 120 

TTACATAGTA AAAGACTTGG ACTGGAAATG GGTCATATTT GGGGCCTATG CGTTTGGCAG 180 

TTGCATTAAC C ACT C AAT G A CTCTGGCTAT TCATGAGATT GCCCACAATG CTGCCTTTGG 240 

65 CAACTGCAAA GCAATGTGGA ATCGCTGGTT TGGAATGTTT GCTAATCTTC CTATTGGGAT 300 
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TCCATATTCA ATTTCCTTTA AGAGGTATCA CATGGATCAT CATCGGTACC TTGGAGCTGA 360 

TGGCGTCGAT GT AG AT AT T C CTACCGATTT TGAGGGCTGG TTCTTCTGTA CCGCTTTCAG 4 20 

5 

AAAGTTTATA TGGGTTATTC TTCAGCCTCT CTTTTATGCC TTTCGACCTC TGTTCATCAA 4 80 

CCCCAAACCA ATTACGTATC TGGAAGTTAT CAATACCGTG GCACAGGTCA CTTTTGACAT 54 0 

10 TTTAATTTAT TACTTTTTGG GAATTAAATC CTTAGTCTAC ATGTTGGCAG CATCTTTACT 600 

TGGCCTGGGT TTGCACCCAA TTTCTGGACA TTTTATAGCT GAGCATTACA TGTTCTTAAA 660 

GGGTCATGAA ACTTACTCAT ATTATGGGCC TCTGAATTTA CTTACCTTCA ATGTGGGTTA 720 

15 

TCATAATGAA CATCATGATT TCCCCAACAT TCCTGGAAAA AGTCTTCCAC TGGTGAGGAA 780 

AATAGCAGCT GAATACTATG ACAACCTCCC TCACTACAAT TCCTGGATAA AAGTACTGTA 840 

20 TGATTTTGTG ATGGATGATA CAATAAGTCC CT ACT C AAG A AT G AAGAGGC ACCAAAAAGG 900 

AGAGATGGTG CTGGAGTAAA TATCATTAGT GCCAAAGGGA TTCTTCTCCA AAACTTTAGA 960 

TGATAAAATG GAATTTTTGC ATTATTAAAC TTGAGACCAG TGATGCTCAG AAGCTCCCCT 1020 

25 

GGCACAATTT CAGAGTAAGA GCTCGGTGAT ACCAAGAAGT GAATCTGGCT TTTAAACAGT 1080 

CAGCCTGACT CTGTACTGCT CAGTTTCACT CACAGGAAAC TTGTGACTTG TGTATTATCG 1140 

30 TCATTGAGGA TGTTTCACTC ATGTCTGTCA TTTTATAAGC ATATCATTTA AAAAGCTTCT 1200 

AAAAAGCTAT TTCGCCAGG 1219 

35 (2) INFORMATION FOR SEQ ID NO: 28: 

y 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



45 



50 



TTACCTTCTA CGTCCGCTTC TTCCTCACTT ATGTGCCACT ATTGGGGCTG AAAGCTTCCT 60 

GGGCCTTTTC TTCATAGTCA GGTTCCTGGA AAGCAACTGG TTTGTGTGGG TGACACAGAT 120 

GAACCATATT CCCATGCACA TTGATCATGA CCGGAACATG GACTGGGTTT CCACCCAGCT 180 

55 CCAGGCCACA TGCAATGTCC ACAAGTCTGC CTTCAATGAC TGGTTCAGTG GACACCTCAA 240 

CTTCCAGATT GAGCACCATC TTTTTCCCAC GATGCCTCGA CACAATTACC ACAAAGTGGC 300 

TCCCCTGGTG CAGTCCTTGT GTGCCAAGCA TGGCATAGAG TACCAGTCCA AGCCCCTGCT 360 

60 

GTCAGCCTTC GCCGACATCA TCCACTCACT AAAGGAGTCA GGGCAGCTCT GGCTAGATGC 420 

CTATCTTCAC CAATAACAAC AGCCACCCTG CCCAGTCTGG AAGAAGAGGA GGAAG ACT CT -4 80 

65 GGAGCCAAGG CAGAGGGGAG CTTGAGGGAC AATGCCACTA TAGTTTAATA CTCAGAGGGG 54 0 

-160- 
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GTTGGGTTTG GGGACATAAA GCCTCTGACT CAAACTCCTC CCTTTTATCT TCTAGCCACA 600 
^ GTTCTAAGAC CCAAAGTGGG GGGTGGACAC AGAAGTCCCT AGGAGGGAAG GAGCT 655 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

15 <ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3506132) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

GTCTTTTACT TTGGCAATGG CTGGATTCCT ACCCTCATCA CGGCCTTTGT CCTTGCTACC 60 

20 

TCTCAGGCCC AAGCTGGATG GCTGCAACAT GATTATGGCC ACCTGTCTGT CTACAGAAAA 120 

CCCAAGTGGA ACCACCTTGT CCACAAATTC GTCATTGGCC ACTTAAAGGG TGCCTCTGCC 180 

25 AACTGGTGGA AT CAT CGCCA CTTCCAGCAC CACGCCAAGC CTAACATCTT CCACAAGGAT 24 0 

CCCGATGTGA ACATGCTGCA CGTGTTTGTT CTGGGCGAAT GGCAGCCCAT CGAGTACGGC 300 

AAGA 304 



30 



65 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 918 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

40 (ii) MOLECULE TYPE: other nucleic acid (Edited Contig 3854933) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

45 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG TACAACATCA GCGAGTTCAC CCGCCGGCAT 120 

CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

50 GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 24 0 

CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 

^ CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 360 

CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 4 20 

TTTGGGACGT CCTTTTTGCC CTTCCTCCTC TGTGCGGTGC TGCTCAGTGC AGTTCAGGCC 4 80 

60 CAGGCTGGCT GGCTGCAGCA TGACTTTGGG CACCTGTCGG TCTTCAGCAC CTCAAAGTGG 54 0 

AACCATCTGC TACATCATTT TGTGATTGGC CACCTGAAGG GGGCCCCCGC CAGTTGGTGG 600 

AACCACATGC ACTTCCAGCA CCATGCCAAG CCCAACTGCT TCCGCAAAGA CCCAGACATC 660 

-161- 
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30 



AAC AT G CAT C CCTTCTTCTT TGCCTTGGGG AAGATCCTCT CTGTGGAGCT TGGGAAACAG 720 

AAGAAAAAAT ATATGCCGTA CAACCACCAG CACARATACT TCTTCCTAAT TGGGCCCCCA 780 

GCCTTGCTGC CTCTCTACTT CCAGTGGTAT ATTTTCTATT TTGTTATCCA GCGAAAGAAG 840 

TGGGTGGACT TGGCCTGGAT CAGCAAACAG GAATACGATG AAGCCGGGCT TCCATTGTCC 900 

ACCGCAAATG CTTCTAAA 918 

(2) INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: other nucleic acid (Edited Contig 2511785) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 



60 



J 25 GCCACTTAAA GGGTGCCTCT GCCAACTGGT GGAATCATCG CCACTTCCAG CACCACGCCA 

AGCCTAACAT CTTCCACAAG GAT CCCGATG TGAACATGCT GCACGTGTTT GTTCTGGGCG 120 

AATGGCAGCC CATCGAGTAC GGCAAGAAGA AGCTGAAATA CCTGCCCTAC AATCACCAGC 180 

ACGAATACTT CTTCCTGATT GGGCCGCCGC TGCTCATCCC CATGTATTTC CAGTACCAGA 24 0 

TCATCATGAC CATGATCGTC CATAAGAACT GGGTGGACCT GGCCTGGGCC GT C AGCT ACT 300 

35 ACATCCGGTT CTTCATCACC TACATCCCTT TCTACGGCAT CCTGGGAGCC CTCCTTTTCC 360 

TCAACTTCAT CAGGTTCCTG GAGAGCCACT GGTTTGTGTG GGT C AC AC AG ATGAATCACA 420 

TCGTCATGGA GATTGACCAG GAGGCCTACC GTGACTGGTT CAGTAGCCAG CTGACAGCCA 480 

40 

CCTGCAACGT GGAGCAGTCC TTCTTCAACG ACTGGTTCAG TGGACACCTT AACTTCCAGA 54 0 

TTGAGCACCA CCTCTTCCCC ACCATGCCCC GGCACAACTT ACACAAGATC GCCCCGCTGG 600 

45 TGAAGTCTCT ATGTGCCAAG CATGGCATTG AATACCAGGA GAAGCCGCTA CTGAGGGCCC 660 

TGCTGGACAT CATCAGGTCC CTGAAGAAGT CTGGGAAGCT GTGGCTGGAC GCCTACCTTC 720 

^ ACAAATGAAG CCACAGCCCC CGGGACACCG TGGGGAAGGG GTGCAGGTGG GGTGATGGCC 780 

AG AG G AAT G A TGGGCTTTTG TTCTGAGGGG TGTCCGAGAG GCTGGTGTAT GCACTGCTCA 840 

CGGACCCCAT GTTGGATCTT TCTCCCTTTC TCCTCTCCTT TTTCTCTTCA CATCTCCCCC 900 

55 ATAGCACCCT GCCCTCATGG GACCTGCCCT CCCTCAGCCG TCAGCCATCA GCCATGGCCC 960 

TCCCAGTGCC TCCTAGCCCC TTCTTCCAAG GAGCAGAGAG GTGGCCACCG GGGGTGGCTC 1020 

TGTCCTACCT CCACTCTCTG CCCCTAAAGA TGGGAGGAGA CCAGCGGTCC ATGGGTCTGG 1080 

60 

CCTGTGAGTC TCCCCTTGCA GCCTGGTCAC TAGGCATCAC CCCCGCTTTG GTTCTTCAGA 114 0 

TGCTCTTGGG GTTCATAGGG GCAGGTCCTA GTCGGGCAGG GCCCCTGACC CTCCCGGCCT 1200 

65 GGCTTCACTC TCCCTGACGG CTGCCATTGG TCCACCCTTT CATAGAGAGG CCTGCTTTGT 1260 

-162- 
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TACAAAGCTC GGGTCTCCCT CCTGCAGCTC GGTTAAGTAC CCGAGGCCTC TCTTAAGATG 1320 

TCCAGGGCCC CAGGCCCGCG GGCACAGCCA GCCCAAACCT TGGGCCCTGG AAGAGTCCTC 1380 

CACCCCATCA CTAGAGTGCT CTGACCCTGG GCTTTCACGG GCCCCATTCC ACCGCCTCCC 14 4 0 

CAACTTGAGC CTGTGACCTT GGGACCAAAG GGGGAGTCCC TCGTCTCTTG TGACTCAGCA 1500 

10 GAGGCAGTGG CCACGTTCAG GGAGGGGCCG GCTGGCCTGG AGGCTCAGCC CACCCTCCAG 1560 

CTTTTCCTCA GGGTGTCCTG AGGTCCAAGA TTCTGGAGCA ATCTGACCCT TCTCCAAAGG 1620 

CTCTGTTATC AGCTGGGCAG TGCCAGCCAA TCCCTGGCCA TTTGGCCCCA GGGGACGTGG 1680 

GCCCTG 1 686 



15 



20 



30 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 
I J (B) TYPE: nucleic acid 

Q (C) STRANDEDNESS: single 

|.| 25 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Contig 2535) 



m 

|;3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 





GTCTTTTACT 


TTGGCAATGG 


CTGGATTCCT 


ACCCTCATCA 


CGGCCTTTGT 


CCTTGCTACC 


60 


35 


TCTCAGGCCC 


AAGCTGGATG 


GCTGCAACAT 


GATTATGGCC 


ACCTGTCTGT 


CTACAGAAAA 


120 


CCCAAGTGGA 


ACCACCTTGT 


CCACAAATTC 


GTCATTGGCC 


ACTTAAAGGG 


TGCCTCTGCC 


180 




AACTGGTGGA 


ATCATCGCCA 


CTTCCAGCAC 


CACGCCAAGC 


CTAACATCTT 


CCACAAGGAT 


240 


40 


CCCGATGTGA 


ACATGCTGCA 


CGTGTTTGTT 


CTGGGCGAAT 


GGCAGCCCAT 


CGAGTACGGC 


300 




AAGAAGAAGC 


TGAAATACCT 


GCCCTACAAT 


CACCAGCACG 


AATACTTCTT 


CCTGATTGGG 


360 


45 


CCGCCGCTGC 


TCATCCCCAT 


GTATTTCCAG 


TACCAGATCA 


TCATGACCAT 


GATCGTCCAT 


420 


AAGAACTGGG 


TGGACCTGGC 


CTGGGCCGTC 


AGCTACTACA 


TCCGGTTCTT 


CATCACCTAC 


480 




ATCCCTTTCT 


ACGGCATCCT 


GGGAGCCCTC 


CTTTTCCTCA 


ACTTCATCAG 


GTTCCTGGAG 


540 


50 


AGCCACTGGT 


TTGTGTGGGT 


CACACAGATG 


AATCACATCG 


TCATGGAGAT 


TGACCAGGAG 


600 




GCCTACCGTG 


ACTGGTTCAG 


TAGCCAGCTG 


ACAGCCACCT 


GCAACGTGGA 


GCAGTCCTTC 


660 


55 


TTCAACGACT 


GGTTCAGTGG 


ACACCTTAAC 


TTCCAGATTG 


AGCACCACCT 


CTTCCCCACC 


720 


ATGCCCCGGC 


AC AACT T AC A 


CAAGATCGCC 


CCGCTGGTGA 


AGTCTCTATG 


TGCCAAGCAT 


780 




GGCATTGAAT 


ACCAGGAGAA 


GCCGCTACTG 


AGGGCCCTGC 


TGGAC AT CAT 


CAGGTCCCTG 


840 


60 


AAGAAGTCTG 


GGAAGCTGTG 


GCTGGACGCC 


TACCTTCACA 


AATGAAGCCA 


CAGCCCCCGG 


900 




GACACCGTGG 


GGAAGGGGTG 


CAGGTGGGGT 


GATGGCCAGA 


GGAATGATGG 


GCTTTTGTTC 


960 


65 


TGAGGGGTGT 


CCGAGAGGCT 


GGTGTATGCA 


CTGCTCACGG 

-163- 


ACCCCATGTT 


GGATCTTTCT 


1020 
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CCCTTTCTCC TCTCCTTTTT CTCTTCACAT CTCCCCCATA GCACCCTGCC CTCATGGGAC 1080 

CTGCCCTCCC TCAGCCGTCA GCCATCAGCC ATGGCCCTCC CAGTGCCTCC TAGCCCCTTC 1140 

TTCCAAGGAG CAGAGAGGTG GCCACCGGGG GTGGCTCTGT CCTACCTCCA CTCTCTGCCC 1200 

CTAAAGATGG GAGGAGACCA GCGGTCCATG GGTCTGGCCT GTGAGTCTCC CCTTGCAGCC 1260 

TGGTCACTAG GCATCACCCC CGCTTTGGTT CTTCAGATGC TCTTGGGGTT CATAGGGGCA 1320 

GGTCCTAGTC GGGCAGGGCC CCTGACCCTC CCGGCCTGGC TTCACTCTCC CTGACGGCTG 1380 

CCATTGGTCC ACCCTTTCAT AGAGAGGCCT GCTTTGTTAC AAAGCTCGGG TCTCCCTCCT 14 4 0 

15 GCAGCTCGGT TAAGTACCCG AGGCCTCTCT TAAGATGTCC AGGGCCCCAG GCCCGCGGGC 1500 

ACAGCCAGCC CAAACCTTGG GCCCTGGAAG AGTCCTCCAC CCCATCACTA GAGTGCTCTG 1560 

ACCCTGGGCT TTCACGGGCC CCATTCCACC GCCTCCCCAA CTTGAGCCTG TGACCTTGGG 1620 

20 

ACCAAAGGGG GAGTCCCTCG TCTCTTGTGA CTCAGCAGAG GCAGTGGCCA CGTTCAGGGA 1680 

GGGGCCGGCT GGCCTGGAGG CTCAGCCCAC CCTCCAGCTT TTCCTCAGGG TGTCCTGAGG 17 4 0 

25 TCCAAGATTC TGGAGCAATC TGACCCTTCT CCAAAGGCTC TGTTATCAGC TGGGCAGTGC 1800 

CAGCCAATCC CTGGCCATTT GGCCCCAGGG GACGTGGGCC CTG 1843 



|« * 30 (2) INFORMATION FOR SEQ ID NO: 33: 

f. ! * 

(i) SEQUENCE CHARACTERISTICS: 
E : = (A) LENGTH: 2257 base pairs 

E=# (B) TYPE: nucleic acid 

CO 35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid (Edited Contig 253538a) 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CAGGGACCTA CCCCGCGCTA CTTCACCTGG GACGAGGTGG CCCAGCGCTC AGGGTGCGAG 60 

GAGCGGTGGC TAGTGATCGA CCGTAAGGTG TACAACATCA GCGAGTTCAC CCGCCGGCAT 120 

45 

CCAGGGGGCT CCCGGGTCAT CAGCCACTAC GCCGGGCAGG ATGCCACGGA TCCCTTTGTG 180 

GCCTTCCACA TCAACAAGGG CCTTGTGAAG AAGTATATGA ACTCTCTCCT GATTGGAGAA 24 0 

50 CTGTCTCCAG AGCAGCCCAG CTTTGAGCCC ACCAAGAATA AAGAGCTGAC AGATGAGTTC 300 

CGGGAGCTGC GGGCCACAGT GGAGCGGATG GGGCTCATGA AGGCCAACCA TGTCTTCTTC 360 

^ CTGCTGTACC TGCTGCACAT CTTGCTGCTG GATGGTGCAG CCTGGCTCAC CCTTTGGGTC 420 

TTTGGGACGT CCTTTTTGCC CTTCCTCCTC TGTGCGGTGC TGCTCAGTGC AGTTCAGCAG 4 80 

GCCCAAGCTG GATGGCT GC A ACATGATTAT GGCCACCTGT CTGTCTACAG AAAACCCAAG 54 0 

60 TGGAACCACC TTGTCCACAA ATTCGTCATT GGCCACTTAA AGGGTGCCTC TGCCAACTGG 600 

TGGAATCATC GCCACTTCCA GCACCACGCC AAGCCTAACA TCTTCCACAA GGATCCCGAT 660 

GTGAACATGC TGCACGTGTT TGTTCTGGGC GAATGGCAGC CCATCGAGTA CGGCAAGAAG 720 

-164- 
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AAGCTGAAAT 


ACCTGCCCTA 


CAATCACCAG 


CACGAATACT 


TCTTCCTGAT 


TGGGCCGCCG 


780 


CTGCTCATCC 


CCATGTATTT 


CCAGTACCAG 


ATCATCATGA 


CCATGATCGT 


CCATAAGAAC 


840 


TGGGTGGACC 


TGGCCTGGGC 


CGTCAGCTAC 


TACATCCGGT 


TCTTCATCAC 


CTACATCCCT 


900 


TTCTACGGCA 


TCCTGGGAGC 


CCTCCTTTTC 


CTCAACTTCA 


TCAGGTTCCT 


GGAGAGCCAC 


960 


TGGTTTGTGT 


GGGTCACACA 


GATGAATCAC 


ATCGTCATGG 


AGATTGACCA 


GGAGGCCTAC 


1020 


CGTGACTGGT 


TCAGTAGCCA 


GCTGACAGCC 


ACCTGCAACG 


TGGAGCAGTC 


CTTCTTCAAC 


1080 


GACTGGTTCA 


GTGGACACCT 


TAACTTCCAG 


ATTGAGCACC 


ACCTCTTCCC 


CACCATGCCC 


1140 


CGGCACAACT 


TACACAAGAT 


CGCCCCGCTG 


GTGAAGTCTC 


TATGTGCCAA 


GCATGGCATT 


1200 


GAATACCAGG 


AGAAGCCGCT 


ACTGAGGGCC 


CTGCTGGACA 


TCATCAGGTC 


CCTGAAGAAG 


1260 


TCTGGGAAGC 


TGTGGCTGGA 


CGCCTACCTT 


CACAAATGAA 


GCCACAGCCC 


CCGGGACACC 


1320 


GTGGGGAAGG 


GGTGCAGGTG 


GGGTGATGGC 


CAGAGGAATG 


ATGGGCTTTT 


GTTCTGAGGG 


1380 


GTGTCCGAGA 


GGCTGGTGTA 


TGCACTGCTC 


ACGGACCCCA 


TGTTGGATCT 


TTCTCCCTTT 


1440 


CTCCTCTCCT 


TTTTCTCTTC 


ACATCTCCCC 


CATAGCACCC 


TGCCCTCATG 


GGACCTGCCC 


1500 


TCCCTCAGCC 


GTCAGCCATC 


AGCCATGGCC 


CTCCCAGTGC 


CTCCTAGCCC 


CTTCTTCCAA 


1560 


GGAGCAGAGA 


GGTGGCCACC 


GGGGGTGGCT 


CTGTCCTACC 


TCCACTCTCT 


GCCCCTAAAG 


1620 


ATGGGAGGAG 


ACCAGCGGTC 


CATGGGTCTG 


GCCTGTGAGT 


CTCCCCTTGC 


AGCCTGGTCA 


1680 


CTAGGCATCA 


CCCCCGCTTT 


GGTTCTTCAG 


ATGCTCTTGG 


GGTTCATAGG 


GGCAGGTCCT 


1740 


AGT CGGGC AG 


GGCCCCTGAC 


CCTCCCGGCC 


TGGCTTCACT 


CTCCCTGACG 


GCTGCCATTG 


1800 


GTCCACCCTT 


T C AT AGAG AG 


GCCTGCTTTG 


TTACAAAGCT 


CGGGTCTCCC 


TCCTGCAGCT 


1860 


CGGTTAAGTA 


CCCGAGGCCT 


CTCTTAAGAT 


GTCCAGGGCC 


CCAGGCCCGC 


GGGCACAGCC 


1920 


AGCCCAAACC 


TTGGGCCCTG 


GAAGAGTCCT 


CCACCCCATC 


ACTAGAGTGC 


TCTGACCCTG 


1980 


GGCTTTCACG 


GGCCCCATTC 


CACCGCCTCC 


CCAACTTGAG 


CCTGTGACCT 


TGGGACCAAA 


2040 


GGGGGAGTCC 


CTCGTCTCTT 


GTGACTCAGC 


AGAGGCAGTG 


GCCACGTTCA 


GGGAGGGGCC 


2100 


GGCTGGCCTG 


GAGGCTCAGC 


CCACCCTCCA 


GCTTTTCCTC 


AGGGTGTCCT 


GAGGTCCAAG 


2160 


ATTCTGGAGC 


AATCTGACCC 


TTCTCCAAAG 


GCTCTGTTAT 


CAGCTGGGCA 


GTGCCAGCCA 


2220 


ATCCCTGGCC 


ATTTGGCCCC 


AGGGGACGTG 


GGCCCTG 






2257 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 2692004) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

-165- 
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His 


Ala 


Asp 


Arg 


Arg 


Arg 


Glu 


He 


1 








5 








Lys 


Ser 


Leu 


Met 


Lys 
20 


Pro 


Asp 


Pro 


Met 


Met 


Val 


Leu 


Thr 
35 


Gin 


Leu 


Gly 


Leu 


Asp 


Trp 


Lys 


Trp 
50 


Val 


He 


Phe 


Cys 


He 


Asn 


His 


Ser 
65 


Met 


Thr 


Leu 


Asn 


Ala 


Ala 


Phe 


Gly Asn 


Cys 


Lys 










80 








Gly 


Met 


Phe 


Ala 


Asn 
95 


Leu 


Pro 


He 


Phe 


Lys 


Arg 


Tyr 


His 
110 


Met 


Asp 


His 


Gly 


Val 


Asp 


Val 


Asp 
125 


He 


Pro 


Thr 


Cys 


Thr 


Ala 


Phe 


Arg 
140 


Lys 


Phe 


He 


Phe 


Tyr 


Ala 


Phe 


Arg 

155 


Pro 


Leu 


Phe 


Tyr 


Leu 


Glu 


Val 


He 
170 


Asn 


Thr 


Val 


Leu- 


He 


Tyr 


Tyr 


Phe 
185 


Leu 


Gly 


He 


Ala 


Ala 


Ser 


Leu 


Leu 
200 


Gly 


Leu 


Gly 


Phe 


He 


Ala 


Glu 


His 
215 


Tyr 


Met 


Phe 


Ser 


Tyr 


Tyr 


Gly 


Pro 
230 


Leu 


Asn 


Leu 


His 


Asn 


Glu 


His 


His 
245 


Asp 


Phe 


Pro 


Pro 


Leu 


Val 


Arg 


Lys 
260 


He 


Ala 


Ala 


His 


Tyr 


Asn 


Ser 


Trp 
275 


He 


Lys 


Val 


Asp 


Thr 


He 


Ser 


Pro 
290 


Tyr 


Ser 


Arg 


Glu 


Met 


Val 


Leu 


Glu 
305 


Xaa 


He 


Ser 


Ser 


Lys 


Thr 


Leu 


Asp 
320 


Asp 


Lys 


Met 


Xaa 


Asp 


Gin 


Xaa 


Cys 
335 


Ser 


Glu 


Ala 


Lys 


Ser 


Ser 


Val 


He 
350 


Pro 


Arg 


Ser 


Ser 


Leu 


Thr 


Leu 


Tyr 
365 


Cys 


Ser 


Val 


Leu 


Val 


Tyr 


Tyr 


Arg 
380 


His 


Xaa 


Gly 


Phe 


He 


Ser 


He 


Ser 
400 


Phe 


Lys 


Lys 



Arg 



Leu 


Ala 
10 


Lys 


Tyr 


Pro 


Glu 


He 
15 


Asn 


Leu 
25 


He 


Trp 


He 


He 


He 
30 


Ala 


Phe 
40 


Tyr 


He 


Val 


Lys 


Asp 
45 


Gly 


Ala 
55 


Tyr 


Ala 


Phe 


Gly 


Ser 
60 


Ala 


He 
70 


His 


Glu 


He 


Ala 


His 
75 


Ala 


Met 


Trp Asn Arg 


Trp 


Phe 




85 










90 


Gly 


He 
100 


Pro 


Tyr 


Ser 


He 


Ser 
105 


His 


Arg 
115 


Tyr 


Leu 


Gly 


Ala 


Asp 
120 


Asp 


Phe 
130 


Glu 


Gly 


Trp 


Phe 


Phe 
135 


Trp 


Val 
145 


He 


Leu 


Gin 


Pro 


Leu 
150 


He 


Asn 
160 


Pro 


Lys 


Pro 


He 


Thr 
165 


Ala 


Gin 
175 


Val 


Thr 


Phe 


Asp 


He 
180 


Lys 


Ser 
190 


Leu 


Val 


Tyr 


Met 


Leu 
195 


Leu 


His 
205 


Pro 


He 


Ser 


Gly 


His 
210 


Leu 


Lys 
220 


Gly 


His 


Glu 


Thr 


Tyr 
225 


Leu 


Thr 
235 


Phe 


Asn 


Val 


Gly 


Tyr 
240 


Asn 


He 
250 


Pro 


Gly 


Lys 


Ser 


Leu 
255 


Glu 


Tyr 
265 


Tyr 


Asp 


Asn 


Leu 


Pro 
270 


Leu 


Tyr 
280 


Asp 


Phe 


Val 


Met 


Asp 
285 


Met 


Lys 
295 


Arg 


His 


Gin 


Lys 


Gly 
300 


Leu 


Val 
310 


Pro 


Lys 


Gly 


Phe 


Phe 
315 


Glu 


Phe 
325 


Leu 


His 


Tyr 


Xaa 


Thr 
330 


Pro 


Leu 
340 


Ala 


Gin 


Phe 


Gin 


Ser 
345 


Glu 


Ser 
355 


Gly 


Phe 


Xaa 


Thr 


Val 
360 


Ser 


Leu 


Thr 


Gly Asn 


Leu 


Xaa 




370 










375 


Cys 


Phe 
385 


Thr 


His 


Val 


Cys 


His 
390 


Leu 


Leu 
405 


Lys 


Ser 


Tyr 


Phe 


Ala 
410 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: amino acid (Translation of Contig 2153526) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



Tyr 


Leu 


Leu 


Arg 


Pro 


Leu 


Leu 


Pro 


His 


Leu 


Cys 


Ala 


Thr 


He 


Gly 


1 








5 










10 










15 


Ala 


Glu 


Ser 


Phe 


Leu 
20 


Gly 


Leu 


Phe 


Phe 


He 
25 


Val 


Arg 


Phe 


Leu 


Glu 
30 


Ser 


Asn 


Trp 


Phe 


Val 
35 


Trp 


Val 


Thr 


Gin 


Met 
40 


Asn 


His 


He 


Pro 


Met 
45 


His 


He 


Asp 


His 


Asp 
50 


Arg 


Asn 


Met 


Asp 


Trp 

55 


Val 


Ser 


Thr 


Gin 


Leu 
60 


Gin 


Ala 


Thr 


Cys 


Asn 
65 


Val 


His 


Lys 


Ser 


Ala 
70 


Phe 


Asn 


Asp 


Trp 


Phe 
75 


Ser 


Gly 


His 


Leu 


Asn 
80 


Phe 


Gin 


He 


Glu 


His 
85 


His 


Leu 


Phe 


Pro 


Thr 
90 


Met 


Pro 


Arg 


His 


Asn 
95 


Tyr 


His 


Lys 


Val 


Ala 
100 


Pro 


Leu 


Val 


Gin 


Ser 
105 


Leu 


Cys 


Ala 


Lys 


His 
110 


Gly 


He 


Glu 


Tyr 


Gin 
115 


Ser 


Lys 


Pro 


Leu 


Leu 
120 


Ser 


Ala 


Phe 


Ala 


Asp 
125 


He 


He 


His 


Ser 


Leu 
130 


Lys 


Glu 


Ser 


Gly 


Gin 
135 


Leu 


Trp 


Leu 


Asp 


Ala 
140 


Tyr 


Leu 


His 


Gin 


Xaa 
145 


Gin 


Gin 


Pro 


Pro 


Cys 
150 


Pro 


Val 


Trp 


Lys 


Lys 


Arg 


Arg 


Lys 


Thr 


Leu 


Glu 


Pro Arg 


Gin 


Arg 










155 










160 










165 


Gly 


Ala 


Xaa 


Gly 


Thr 
170 


Met 


Pro 


Leu 


Xaa 


Phe 
175 


Asn 


Thr 


Gin 


Arg 


Gly 
180 


Leu 


Gly 


Leu 


Gly 


Thr 
185 


Xaa 


Ser 


Leu 


Xaa 


Leu 
190 


Lys 


Leu 


Leu 


Pro 


Phe 
195 


He 


Phe 


Xaa 


Pro 


Gin 
200 


Phe 


Xaa 


Asp 


Pro 


Lys 
205 


Trp 


Gly 


Val 


Asp 


Thr 
210 


Glu 


Val 


Pro 


Arg 


Arg 
215 


Glu 


Gly 


Ala 

















(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOFOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 3506132) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



Val 


Phe 


Tyr 


Phe 


Gly 


Asn 


Gly 


Trp 


He 


Pro 


Thr 


Leu 


He 


Thr 


Ala 


1 








5 










10 










15 


Phe 


Val 


Leu 


Ala 


Thr 
20 


Ser 


Gin 


Ala 


Gin 


Ala 
25 


Gly 


Trp 


Leu 


Gin 


His 
30 


Asp 


Tyr 


Gly 


His 


Leu 
35 


Ser 


Val 


Tyr 


Arg 


Lys 
40 


Pro 


Lys 


Trp 


Asn 


His 
45 


Leu 


Val 


His 


Lys 


Phe 
50 


Val 


He 


Gly 


His 


Leu 
55 


Lys 


Gly 


Ala 


Ser 


Ala 
60 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 










65 










70 








75 


Leu 


Gly 


Glu 


Trp 


Gin 
80 


Pro 


He 


Glu 


Tyr 


Gly 
85 


Lys 


Xaa 
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(2) INFORMATION FOR SEQ ID NO: 37: 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 3854933) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



Gin 


Glv 


Pro 


Thr 


Pro 


Arg 


i yr 


Phe 


Thr 


Trp 


Asp 


Glu 


Val 


nla 


bin 


1 








5 










X u 










1 D 


Arg 


Se r 


Gly 


Cys 


Glu 


Glu 


rg 


Trp 


Leu 


V d X 


T1 0 
116 


Asp 


Arg 


Lys 


vai 










20 




















JU 


Tvr 

j 


Asn 


He 


Ser 


Glu 


Phe 


Thr 


Arg 


Arg 


nib 


rro 


Gly 


Gly 


Ser 


Arg 










35 










*t KJ 












Val 


He 


Ser 


His 


i yi 


Ala 


Gly 


Gin 


ASp 


nlcl 


1 ilX 


Asp 


Pro 


Phe 


vai 










50 




















ou 


Ala 


Phe 


His 


He 


Asn 


Lys 


Gly 


Leu 


Val 


Lys 


Lys 


Tyr 


Met 


Asn 


C A -r- 
JCI 










65 










/ u 










/ 0 


Leu 


Leu 


He 


Gly 


Glu 


Leu 


Ser 


t ro 




Valu 


r tU 


Ser 


Phe 


Glu 


Pro 










80 




















q n 
y u 


Thr 


Lvs 


Asn 


Lys 


Glu 


Leu 


Thr 


Asp 


Glu 


Phe 


Arg 


Glu 


Leu 




r\±. d 










95 










i on 










i nc 
1UD 


Thr 


Val 


Glu 


Arg 


Met 


Glv 


Leu 


Met 


Lys 


Ala 


sn 


His 


Val 


XT 116 


ir xie 










110 










1 1 t 

J. J. o 












Leu 


Leu 


i yr 


u 






Tie 


Leu 


Leu 


Leu 


Asp 


Gly Ala 


Ala 


Trp 




















1 JO 










135 


Leu 


Thr 


Leu 


Trp 


Val 


Phe 


Gly 


Thr 


Ser 


Phe 


Leu 


Pro 


Phe 


Leu 


Leu 










140 










145 










150 


Cys 


Ala 


Val 


Leu 


Leu 


Ser 


Ala 


Val 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 










155 










160 








165 


Gin 


His 


Asp 


Phe 


Gly 


His 


Leu 


Ser 


Val 


Phe 


Ser 


Thr 


Ser 


Lys 


Trp 










170 










175 










180 


Asn 


His 


Leu 


Leu 


His 


His 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly 


Ala 










185 










190 










195 


Pro 


Ala 


Ser 


Trp 


Trp 


Asn 


His 


Met 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 










200 










205 










210 


Pro 


Asn 


Cys 


Phe 


Arg 


Lys 


Asp 


Pro 


Asp 


He 


Asn 


Met 


His 


Pro 


Phe 










215 










220 










225 


Phe 


Phe 


Ala 


Leu 


Gly 


Lys 


He 


Leu 


Ser 


Val 


Glu 


Leu 


Gly 


Lys 


Gin 










230 










235 








240 


Lys 


Lys 


L ys 


Tyr 


Met 


Pro 


Tyr 


Asn 


His 


Gin 


His 


Xaa 


Tyr 


Phe 


Phe 










245 










250 










255 


Leu 


He 


Gly 


Pro 


Pro 


Ala 


Leu 


Leu 


Pro 


Leu 


Tyr 


Phe 


Gin 


Trp 


Tyr 










260 










265 








270 


He 


Phe 


Tyr 


Phe 


Val 


He 


Gin 


Arg 


Lys 


Lys 


Trp 


Val 


Asp 


Leu 


Ala 










275 










280 










285 


Trp 


He 


Ser 


Lys 


Gin 


Glu 


Tyr 


Asp 


Glu 


Ala 


Gly 


Leu 


Pro 


Leu 


Ser 










290 










295 










300 


Thr 


Ala 


Asn 


Ala 


Ser 


Lys 




























305 























60 (2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 

65 (C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: amino acid (Translation of Contig 2511785) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 



His 


Leu 


Lys 


Gly Ala 


Ser 


Ala 


Asn 


Trn 


Trp Asn 


His 


Arg 


His 


rue 


1 


















10 










1 D 


Gin 


His 


His 


Al a 


Lys 


Pro 


Asn 


He 


Phe 


His 


Lys 


Asp 


Pro 


ASp 


v di 










Oft 










25 










•J u 


Asn 


Met 


Leu 


Hi o 
111 3 


Val 


Phe 


Val 


Leu 


Gly 


Glu 


Trp 


OlH 


Pro 


Tl a 

lie 


blu 




















40 












Tyr 


Gly 


Lvs 


Lys 


Lys 


Leu 


Lys 


Tvr 

Jr 


Leu 


Pro 


Tyr 


Asn 


Ui' e 

n i o 




nio 




















55 










ou 


Glu 


Tvr 


Phe 


pho 


Leu 


He 


Glv 


Pro 


Pro 


Leu 


Leu 


Tip 
1 1c 


Pro 




Tyr 










65 










70 










"7 R 


Phe 


Gin 


Tyr 


ri i n 

UJ.U 


Tl o 

lie 


He 


Met 


Thr 


Met 


He 


Val 


His 


ys 


sn 


Trp 










80 










85 










on 


Val 


Asp 


Leu 


Ala 


Trp 


Ala 


Val 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe 


Phe 


Tip 

lie 










95 










100 










lUJ 


Thr 


Tyr 


He 


C 1 \J 




Tvr 


Glv 


He 


Leu 


Gly Ala 


Leu 


Leu 


Phe 


Leu 










110 










115 












Asn 


Phe 


He 


rg 


tilt; 


Leu 


Glu 


Ser 


His 


Trp 


Pne 


Val 




Val 


Thr 










1 9^ 










130 










i 


Gin 


Met 


Asn 


His 


He 


Val 


Met 


Glu 


He 


Asp 




Glu 


Ala 


Tyr 


Arg 




















145 










150 


Asp 


Trp 


Phe 


Ser 


Ser 


Gin 


Leu 


Thr 


Ala 


Thr 


Cys 


Asn 


Val 


Glu 


Gin 










loo 










160 










165 


Ser 


Phe 


Phe 


Asn 


Asp 


Tit* 
i ip 


Phe 


Ser 


Gly His 


Leu 


Asn 


jrne 


Gin 


He 










i "7 n 
i / u 










175 










180 


Glu 


His 


His 


Leu 


Xr 116 


Pro 


Thr 


Met 


Pro 


Arg 


His 


Asn 


Leu 


His 


Lys 










•LOO 










190 










195 


lie 


Ala 


Pro 


Leu 


Val 


Lys 


Ser 


Leu 


Cys 


Ala 


Lys 


Ui q 

m s 




He 


Glu 










v U 










205 










210 


Tvr 


Gin 


Glu 


Lys 


p rn 

rro 


Leu 


Leu 


Arg 


Ala 


Leu 


Leu 


ASp 


lie 


He 


Arg 










215 










220 










225 


Ser 


Leu 


Lys 


Lys 




Glv 


Lys 


Leu 


Trp 


Leu 


Asp 


Ala 


i yi: 


Leu 


His 










230 










235 










240 


Lys 


Xaa 


Ser 


His 


Se r 


Pro 


Arg 


Asp 


Thr 


Val 


Gly 


Lys 


Gly 


Cys 


Arg 










245 










250 










255 


Trp 


Gly Asp 


Gly 




Arg 


Asn 


Asp 


Gly 


Leu 


Leu 


Phe 


Xaa 


Gly Val 










260 










265 










270 


Ser 


Glu 


Arg 


Leu 


Val 


Tvr 

Jr 


Ala 


Leu 


Leu 


Thr 


Asp 


Pro 


Met 


Leu 


Asp 










275 










280 










285 


Leu 


Ser 


Pro 


Phe 


Leu 


Leu 


Ser 


Phe 


Phe 


Ser 


Ser 


His 


Leu 


Pro 


His 










290 










295 










300 


Ser 


Thr 


Leu 


Pro 


Ser 


Tro 


Asp 


Leu 


Pro 


Ser 


Leu 


Ser 


rg 


Gin 


Pro 










305 










310 








315 


Ser 


Ala 


Met 


Ala 


Leu 


Pro 


Val 


Pro 


Pro 


Ser 


Pro 


Phe 


Phe 


Gin 


Gly 










320 










325 










330 


Ala 


Glu 


Arg 


Trp 


Pro 


Pro 


Gly Val 


Ala 


Leu 


Ser 


Tyr 


Leu 


His 


Ser 










335 










340 










345 


Leu 


Pro 


Leu 


Lys 


Met 


Gly 


Gly Asp Gin Arg 


Ser 


Met 


Gly 


Leu 


Ala 










350 










355 










360 


Cys 


Glu 


Ser 


Pro 


Leu 


Ala 


Ala 


Trp 


Ser 


Leu 


Gly 


He 


Thr 


Pro 


Ala 










365 










370 










375 


Leu 


Val 


Leu 


Gin 


Met 


Leu 


Leu 


Gly 


Phe 


He 


Gly 


Ala 


Gly 


Pro 


Ser 










380 










385 










390 


Arg 


Ala 


Gly 


Pro 


Leu 


Thr 


Leu 


Pro 


Ala 


Trp 


Leu 


His 


Ser 


Pro 


Xaa 










400 










405 










410 


Arg 


Leu 


Pro 


Leu 


Val 


His 


Pro 


Phe 


He 


Glu 


Arg 


Pro 


Ala 


Leu 


Leu 










415 










420 










425 


Gin 


Ser 


Ser 


Gly 


Leu 


Pro 


Pro 


Ala 


Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


Gly 










430 










435 








440 
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Leu 


Ser 


Xaa 


Asp 


Val 


Gin 


Gly 


Pro 


Arg 


Pro 


Ala 


Gly 


Thr 


Ala 


Ser 










445 










450 










455 


Pro 


Asn 


Leu 


Gly 


Pro 


Trp 


Lys 


Ser 


Pro 


Pro 


Pro 


His 


His 


Xaa 


Ser 










460 










465 










470 


Ala 


Leu 


Thr 


Leu 


Gly 


Phe 


His 


Gly 


Pro 


His 


Ser 


Thr 


Ala 


Ser 


Pro 










475 










480 










485 


Thr 


Xaa 


Ala 


Cys 


Asp 


Leu 


Gly 


Thr 


Lys 


Gly 


Gly 


Val 


Pro 


Arg 


Leu 










490 










495 










500 


Leu 


Xaa 


Leu 


Ser 


Arg 


Gly 


Ser 


Gly 


His 


Val 


Gin 


Gly Gly 


Ala 


Gly 










505 










510 










515 


Trp 


Pro 


Gly 


Gly 


Ser 


Ala 


His 


Pro 


Pro 


Ala 


Phe 


Pro 


Gin 


Gly 


Val 










520 










525 










530 


Leu 


Arg 


Ser 


Lys 


lie 


Leu 


Glu 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 










535 










540 










545 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 


Gin 


Pro 


He 


Pro 


Gly 


His 


Leu 


Ala 










550 










555 










560 


Pro 


Gly 


Asp 


Val 


Gly 


Pro 


Xaa 


























565 






















(2) 


INFORMATION 


FOR 


SEQ 


ID NO:39 

















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: amino acid (Translation of Contig 2535) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



Val 


Phe 


Tyr 


Phe 


Gly 


Asn 


Gly 


Trp 


He 


Pro 


Thr 


Leu 


He 


Thr 


Ala 


1 








5 










10 










15 


Phe 


Val 


Leu 


Ala 


Thr 


Ser 


Gin 


Ala 


Gin 


Ala 


Gly 


Trp 


Leu 


Gin 


His 










20 










25 










30 


Asp 


Tyr 


Gly 


His 


Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Pro 


Lys 


Trp 


Asn 


His 










35 










40 










45 


Leu 


Val 


His 


Lys 


Phe 


Val 


He 


Gly 


His 


Leu 


Lys 


Gly Ala 


Ser 


Ala 










50 










55 










60 


Asn 


Trp 


Trp 


Asn 


His 


Arg 


His 


Phe 


Gin 


His 


His 


Ala 


Lys 


Pro 


Asn 










65 










70 










75 


He 


Phe 


His 


Lys 


Asp 


Pro 


Asp 


Val 


Asn 


Met 


Leu 


His 


Val 


Phe 


Val 










80 










85 










90 


Leu 


Gly 


Glu 


Trp 


Gin 


Pro 


He 


Glu 


Tyr 


Gly 


Lys 


Lys 


Lys 


Leu 


Lys 










95 










100 










105 


Tyr 


Leu 


Pro 


Tyr 


Asn 


His 


Gin 


His 


Glu 


Tyr 


Phe 


Phe 


Leu 


He 


Gly 










110 










115 










120 


Pro 


Pro 


Leu 


Leu 


He 


Pro 


Met 


Tyr 


Phe 


Gin 


Tyr 


Gin 


He 


He 


Met 










125 










130 










135 


Thr 


Met 


He 


Val 


His 


Lys 


Asn 


Trp 


Val 


Asp 


Leu 


Ala 


Trp 


Ala 


Val 










140 










145 










150 


Ser 


Tyr 


Tyr 


He 


Arg 


Phe 


Phe 


He 


Thr 


Tyr 


He 


Pro 


Phe 


Tyr 


Gly 










155 










160 










165 


He 


Leu 


Gly 


Ala 


Leu 


Leu 


Phe 


Leu 


Asn 


Phe 


He 


Arg 


Phe 


Leu 


Glu 










170 










175 










180 


Ser 


His 


Trp 


Phe 


Val 


Trp 


Val 


Thr 


Gin 


Met 


Asn 


His 


He 


Val 


Met 










185 










190 










195 


Glu 


He 


Asp 


Gin 


Glu 


Ala 


Tyr 


Arg 


Asp 


Trp 


Phe 


Ser 


Ser 


Gin 


Leu 










200 










205 










210 


Thr 


Ala 


Thr 


Cys 


Asn 


Val 


Glu 


Gin 


Ser 


Phe 


Phe 


Asn 


Asp 


Trp 


Phe 










215 










220 








225 


Ser 


Gly 


His 


Leu 


Asn 


Phe 


Gin 


He 


Glu 


His 


His 


Leu 


Phe 


Pro 


Thr 
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9 in 










235 










240 


ne u 


Pro 


Arg 


nlS 


Asn 


Leu 


His 


Lys 


lie 


Ala 


Pro 


Leu 


Val 


Lys 


Ser 










9 

Z 4 D 










250 










255 


Leu 


Cys 


7A 1 a 
Aid 


Lys 


nlS 


/"* 1 i r 

tjiy 


lie 


GlU 


Tyr 


Gin 


Glu 


Lys 


Pro 


Leu 


Leu 










o cn 
Z bU 










265 










270 


Arg 


ax a 


Leu 


Leu 


Asp 


lie 


lie 


Arg 


Ser 


Leu 


Lys 


Lys 


Ser 


Gly 


Lys 










Z / D 










280 










285 


Leu 


Trp 


Leu 


Asp 


TV 1 

Aia 


Tyr 


Leu 


His 


Lys 


Xaa 


Ser 


His 


Ser 


Pro 


Arg 




















295 










300 


Asp 


rri V-> 

inr 


val 


Gly 


Lys 


Gly 


Cys 


Arg 


Trp 


Gly Asp 


Gly 


Gin 


Arg 


Asn 










jUD 










310 










315 


Asp 


r*i t» 
biy 


Leu 


Leu 


T"> V« a 

Phe 


Xaa 


Gly 


Val 


Ser 


Glu 


Arg 


Leu 


Val 


Tyr 


Ala 




















325 










330 


Leu 


Leu 


i nr 


Asp 


Pro 


Met 


Leu 


Asp 


Leu 


Ser 


Pro 


Phe 


Leu 


Leu 


Ser 




















340 










345 




rne 




Ser 


HIS 


Leu 


Pro 


His 


Ser 


Thr 


Leu 


Pro 


Ser 


Trp Asp 










^ c rv 










355 










360 


Leu 


Pro 


C ^ •»- 

ber 


Leu 


Ser 


Arg 


Gin 


Pro 


Ser 


Ala 


Met 


Ala 


Leu 


Pro 


Val 










ice 
J DO 










370 










375 


Pro 


Pro 


oer 


Pro 


Pne 


Pne 


Gin 


Gly Ala 


Glu 


Arg 


Trp 


Pro 


Pro 


Gly 










ion 










385 










390 


vai 


Ala 


Leu 


C A V 


Tyr 


Leu 


His 


Ser 


Leu 


Pro 


Leu 


Lys 


Met 


Gly 


Gly 










4 UU 










405 










410 


Asp 


Pin 

L?in 


Arg 


ber 


Met 


Gly 


Leu 


Ala 


Cys 


Glu 


Ser 


Pro 


Leu 


Ala 


Ala 










4 15 










420 










425 


Trp 


Ser 


Leu 


Lily 


Ti- 
ne 


Thr 


Pro 


Ala 


Leu 


Val 


Leu 


Gin 


Met 


Leu 


Leu 










4 JU 










435 










440 


ijiy 


fne 


lie 


Gly 


TV 1 « 

Ala 


Gly 


Pro 


Ser 


Arg 


Ala 


Gly 


Pro 


Leu 


Thr 


Leu 










4 4 5 










450 










455 


Pro 


TV 1 -i 

Ala 


Trp 


Leu 


His 


Ser 


Pro 


Xaa 


Arg 


Leu 


Pro 


Leu 


Val 


His 


Pro 










4 60 










465 










470 


rxie 


T 1 . 

i le 


GlU 


Arg 


Pro 


Ala 


Leu 


Leu 


Gin 


Ser 


Ser 


Gly 


Leu 


Pro 


Pro 










4 /O 










480 










485 


7\ T _ 

Aia 


TV 1 -i 

Ala 


Arg 


Leu 


Ser 


Thr 


Arg 


Gly 


Leu 


Ser 


Xaa 


Asp 


Val 


Gin 


Gly 










4 yu 










495 










500 


Pro 


Arg 


Pro 


Ala 


Gly 


Thr 


Ala 


Ser 


Pro 


Asn 


Leu 


Gly 


Pro 


Trp 


Lys 




















510 










515 


Se r 


Pro 


Pro 


Pro 


nlS 


His 


Xaa 


Ser 


Ala 


Leu 


Thr 


Leu 


Gly 


Phe 


His 










JtU 










525 










530 


^jj.y 


Pro 


rt IS 


Ser 


i nr 


Aia 


O a v 

oer 


Pro 


Thr 


Xaa 


Ala 


Cys 


Asp 


Leu 


Gly 










^ 










540 










545 


Thr 


Lys 


Gly 


r; 1 \/ 

vj -L y 


V ClX 


rxro 


Arg 


Leu 


Leu 


Xaa 


Leu 


Ser 


Arg 


Gly 


Ser 










550 










555 










560 


Gly 


His 


Val 


Gin 


Gly 


Gly 


Ala 


Gly 


Trp 


Pro 


Gly Gly 


Ser 


Ala 


His 










565 










570 










575 


Pro 


Pro 


Ala 


Phe 


Pro 


Gin 


Gly 


Val 


Leu 


Arg 


Ser 


Lys 


He 


Leu 


Glu 










580 










585 










590 


Gin 


Ser 


Asp 


Pro 


Ser 


Pro 


Lys 


Ala 


Leu 


Leu 


Ser 


Ala 


Gly 


Gin 


Cys 










595 










600 










605 


Gin 


Pro 


lie 


Pro 


Gly 


His 


Leu 


Ala 


Pro 


Gly Asp Val 


Gly 


Pro 


Xaa 










610 










615 










620 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: amino acid (Translation of Contig 253538a 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
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SUBSTITUTE SHEET 



Gin 


Gly 


Pro 


Thr 


Pro 


1 








5 


Arg 


Ser 


Gly 


Cys 


Glu 
20 


Tyr 


Asn 


He 


Ser 


Glu 
35 


Val 


He 


Ser 


His 


Tyr 
50 


Ala 


Phe 


His 


He 


Asn 
65 


Leu 


Leu 


He 


Gly 


Glu 
80 


Thr 


Lys 


Asn 


Lys 


Glu 
95 


Thr 


Val 


Glu 


Arg 


Met 
110 


Leu 


Leu 


Tyr 


Leu 


Leu 
125 


Leu 


Thr 


Leu 


Trp 


Val 
140 


Cys 


Ala 


Val 


Leu 


Leu 
155 


Leu 


Gin 


His 


Asp 


Tyr 
170 


Trp 


Asn 


His 


Leu 


Val 
185 


Ala 


Ser 


Ala 


Asn 


Trp 
200 


Lys 


Pro 


Asn 


He 


Phe 
215 


Val 


Phe 


Val 


Leu 


Gly 
230 


Lys 


Leu 


Lys 


Tyr 


Leu 
245 


Leu 


He 


Gly 


Pro 


Pro 
260 


He 


He 


Met 


Thr 


Met 
275 


Trp 


Ala 


Val 


Ser 


Tyr 
290 


Phe 


Tyr 


Gly 


He 


Leu 
305 


Phe 


Leu 


Glu 


Ser 


His 
320 


He 


Val 


Met 


Glu 


He 
335 


Ser 


Gin 


Leu 


Thr 


Ala 
350 


Asp 


Trp 


Phe 


Ser 


Gly 
365 


Phe 


Pro 


Thr 


Met 


Pro 
380 


Val 


Lys 


Ser 


Leu 


Cys 
400 


Pro 


Leu 


Leu 


Arg 


Ala 
415 


Ser 


Gly 


Lys 


Leu 


Trp 
430 


Ser 


Pro 


Arg 


Asp 


Thr 
445 


Gin 


Arg 


Asn 


Asp 


Gly 
460 


Val 


Tyr 


Ala 


Leu 


Leu 



Arg 


Tyr 


Phe 


Thr 


Trp 
10 


Glu 


Arg 


Trp 


Leu 


Val 
25 


Phe 


Thr 


Arg 


Arg 


His 
40 


Ala 


Gly 


Gin 


Asp 


Ala 
55 


Lys 


Gly 


Leu 


Val 


Lys 
70 


Leu 


Ser 


Pro 


Glu 


Gin 
85 


Leu 


Thr 


Asp 


Glu 


Phe 
100 


Gly 


Leu 


Met 


Lys 


Ala 
115 


His 


He 


Leu 


Leu 


Leu 
130 


Phe 


Glv 


Thr 


Ser 


Phe 
145 


Ser 


Ala 


Val 


Gin 


Gin 
160 


Gly 


His 


Leu 


Ser 


Val 
175 


His 


Lvs 


Phe 


Val 


He 
190 


Trp 


Asn 


His 


Arg 


His 
205 


His 


Lys 


Asd 


Pro Asp 










220 


Glu 


Trp 


Gin 


Pro 


He 
235 


Pro 


Tyr 


Asn 


His 


Gin 
250 


Leu 


Leu 


He 


Pro 


Met 
265 


He 


Val 


His 


Lys 


Asn 
280 


Tyr 


He 


Arg 


Phe 


Phe 
295 


Gly 


Ala 


Leu 


Leu 


Phe 
310 


Trp 


Phe 


Val 


Trp 


Val 

325 


Asp 


Gin 


Glu 


Ala 


Tyr 
340 


Thr 


Cys 


Asn 


Val 


Glu 
355 


His 


Leu 


Asn 


Phe 


Gin 
370 


Arg 


His 


Asn 


Leu 


His 
385 


Ala 


Lys 


His 


Gly 


He 
405 


Leu 


Leu 


Asp 


He 


He 
420 


Leu 


Asp 


Ala 


Tyr 


Leu 
435 


Val 


Gly 


Lys 


Gly 


Cys 
450 


Leu 


Leu 


Phe 


Xaa 


Gly 
465 


Thr 


Asp 


Pro 


Met 


Leu 
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Asp 


Glu 


Val 


Ala 


Gin 
15 


He 


Asp 


Ara 


Lvs 

Jr " 


Val 
30 


Pro 


Glv 


Glv 


Ser 


Ara 
45 


Thr 


Asp 


Pro 


Phe 


Val 
60 


Lys 


Tyr 


Met 


Asn 


Ser 
75 


Pro 


Ser 


Phe 


Glu 


Pro 
90 


Arg 


Glu 


Leu 


Arg 


Ala 
105 


Asn 


His 


Val 


Phe 


Phe 
120 


Asp 


Gly Ala 


Ala 


TrD 










135 


Leu 


Pro 


Phe 


Leu 


Leu 
150 


Ala 


Gin 


Ala 


Gly 


Trn 

165 


Tvr 


Arg 


Lys 


Pro 


Lys 
180 


Glv 


His 


Leu 


Lys 


Glv 
195 


Phe 


Gin 


His 


His 


Ala 
210 


Val 


Asn 


Met 


Leu 


His 
225 


Glu 


Tyr 


Gly Lys 


Lvs 










240 


His 


Glu 


Tyr 


Phe 


Phe 
255 


Tyr 


Phe 


Gin 


Tyr 


Gin 
270 


Trp 


Val 


Asp 


Leu 


Ala 
285 


He 


Thr 


Tyr 


He 


Pro 
300 


Leu 


Asn 


Phe 


He 


Arg 
315 


Thr 


Gin 


Met 


Asn 


His 
330 


Ara 


Asp 


Trp 


Phe 


Ser 
345 


Gin 


Ser 


Phe 


Phe 


Asn 
360 


He 


Glu 


His 


His 


Leu 
375 


Lvs 

Jr 


He 


Ala 


Pro 


Leu 
390 


Glu 


Tyr 


Gin 


Glu 


Lys 
410 


Arg 


Ser 


Leu 


Lys 


Lys 
425 


His 


Lys 


Xaa 


Ser 


His 
440 


Arg 


Trp 


Gly 


Asp 


Gly 
455 


Val 


Ser 


Glu 


Arg 


Leu 
470 


Asp 


Leu 


Ser 


Pro 


Phe 



SUBSTITUTE SHEET 











475 






Leu 


Leu 


Ser 


Phe 


Phe 
490 


Ser 


Ser 


Ser 


Trp 


Asp 


Leu 


Pro 
505 


Ser 


Leu 


Leu 


Pro 


Val 


Pro 


Pro 
520 


Ser 


Pro 


Pro 


Pro 


Gly Val 


Ala 


Leu 


Ser 










535 






Met 


Gly Gly Asp 


Gin 


Arg 


Ser 










550 






Leu 


Ala 


Ala 


Trp 


Ser 
565 


Leu 


Gly 


Met 


Leu 


Leu 


Gly 


Phe 
580 


He 


Gly 


Leu 


Thr 


Leu 


Pro 


Ala 

595 


Trp 


Leu 


Val 


His 


Pro 


Phe 


lie 
610 


Glu 


Arg 


Leu 


Pro 


Pro 


Ala 


Ala 
625 


Aro 


Leu 


Val 


Gin 


Gly 


Pro 


Arq 
640 


Pro 


Ala 


Pro 


Trp 


Lys 


Ser 


Pro 
655 


Pro 


Pro 


Gly 


Phe 


His 


Gly 


Pro 
670 


His 


Ser 


Asp 


Leu 


Gly 


Thr 


Lys 
685 


Gly 


Gly 


Arg 


Gly 


Ser 


Gly 


His 
700 


Val 


Gin 


Ser 


Ala 


His 


Pro 


Pro 
715 


Ala 


Phe 


He 


Leu 


Glu 


Gin 


Ser 
730 


Asp 


Pro 


Gly 


Gin 


Cys 


Gin 


Pro 
745 


He 


Pro 


Gly 


Pro 


Xaa 















480 










485 


His 


Leu 


Pro 
495 


His 


Ser 


Thr 


Leu 


Pro 
500 


Ser 


Arg 


Gin 
510 


Pro 


Ser 


Ala 


Met 


Ala 
515 


Phe 


Phe 


Gin 
525 


Gly 


Ala 


Glu 


Arg 


Trp 
530 


Tyr 


Leu 


His 
540 


Ser 


Leu 


Pro 


Leu 


Lvs 
545 


Met 


Gly 


Leu 
555 


Ala 


Cys 


Glu 


Ser 


Pro 
560 


He 


Thr 


Pro 
570 


Ala 


Leu 


Val 


Leu 


Gin 
575 


Ala 


Gly 


Pro 
585 


Ser 


Aro 


Ala 


Glv 


Pro 
590 


His 


Ser 


Pro 
600 


Xaa 


Aro 


Leu 


Pro 


Leu 
605 


Pro 


Ala 


Leu 
615 


Leu 


Gin 


Ser 


Ser 


Glv 
620 


Ser 


Thr 


Arg 
630 


Glv 


Leu 


Ser 


Xaa 


Asp 
635 


Gly Thr Ala 


Ser 


Pro 


Asn 


Leu 


Gly 






645 










650 


His 


His 


Xaa 
660 


Ser 


Ala 


Leu 


Thr 


Leu 
665 


Thr 


Ala 


Ser 
675 


Pro 


Thr 


Xaa 


Ala 


Cys 
680 


Val 


Pro 


Arg 
690 


Leu 


Leu 


Xaa 


Leu 


Ser 
695 


Gly Gly Ala 


Gly 


Trp 


Pro 


Gly Gly 






705 










710 


Pro 


Gin 


Gly Val 


Leu 


Arg 


Ser 


Lys 






720 










725 


Ser 


Pro 


Lys 
735 


Ala 


Leu 


Leu 


Ser 


Ala 
740 


Gly 


His 


Leu 
750 


Ala 


Pro 


Gly 


Asp 


Val 
755 
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